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© Synthetic urogastrone gene, corresponding plasmid recombinants, transformed cells, production thereof and 
urogastrone expression. 

© A synthetic gene characterised in that it codes for the 
expression of urogastrone or a sub-unit thereof is disclosed. 

The production thereof by the assembly and ligation of a 
number of nucleotide blocks is also disclosed, as are 
corresponding plasmid recombinants, transformed cells and 
the production thereof. 

The expression of urogastrone is further disclosed. 

Urogastrone is a polypeptide hormone (protein) which 
may be isolated in small amounts from human urine. It has 
an application in the treatment of ulcers and in the promo- 
tion of wound healing. The present invention provides a 
more viable commercial production. 
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"Synthetic urogastrone gene, corresponding plasmid 
recombinants , transformed cells, production thereof 
and urogastrone expression" 



This invention relates to a synthetic urogastrone 
gene, to corresponding plasmid recombinants and transformed 
cells, to the production thereof and to urogastrone expression 

Urogastrone is a polypeptide hormone (protein) 
synthesised in the duodenum and in the salivary glands of 
normal humans, (see, for example, Heitz, et al, (1978), 
Gut, 19 , 408-413) . Urogastrone suppresses". ; the secretion 
of gastric acid and promotes cell growth, (see, for example. 
Elder, et al , (1975), Gut, 1£, 887-893). Therefore, it has 
an application, in the treatment of ulcers and in the 
promotion of wound healing. Urogastrone is excreted in 
small amounts in human urine and may be isolated therefrom. 
There exists, however, a need for a more viable commercial 
production thereof and such is provided according to the 
present invention - 

Urogastrone is known to consist of 53 amino 
acids in the following sequence: 
Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp - 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr - 
lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys - 
25 Val Val Gly Tyr He Gly Glu Arg Cys Gin Tyr - 
Arg Asp Leu Lys Trp Trp Glu Leu Arg 

(see, for example, Gregory, H. , and Preston, B.M., (1977), 
Int. J. Peptide Protein Res., 9_, 107-118.) 

From the above amino acid sequence, a 
30 corresponding synthetic gene sequence has been invented, 
subject to a number of specific non-obvious criteria, and 
oligonucleotide blocks synthesised which, when assembled, 
form a synthetic gene coding for urogastrone. The blocks 
have been hybridised and ligated in pre-determined stages 
35 to construct the urogastrone gene in two portions. These 

have been cloned in two operations into a new specifically- 
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designed chimeric E. coli/ S. aureus vector so as to 
produce a full length urogastrone gene flanked only. by 
E. coli plasmid DMA. The gene has been excised from 
this recombinant and re-cloned into vectors specifically 
designed to maximise expression of the gene in E. coli, 
under the control of the promoter obtained from the 
E. coli tryptophan operon. A protein resembling human 
urogastrone has thus been expressed in E. coli. 

From the above amino acid sequence, because 
of the degeneracy of the genetic code, it is possible 
to predict numerous nucleotide sequences which would 

code for the protein. 

In the inventive determination of an optimum 
sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, 
trinucleotide codons should be used which are acceptable 
or preferable in the cells to be used, in particular 
E. cbii . secondly, it was decided that it was desirable 
to have different restriction enzyme recognition sites 
3 at the termini of the molecule so as to allow insertion • 
into a plasmid in a desired orientation. . Moreover, it 
was decided to select sites which allowed the use of 
well-understood cloning vectors, such as pBR^322 (see, 
for example, Bolivar, F. , et al , (1977), Gene, 2, 95-113). 
25 In fact, Hin d . Ill and Bam. HI sites were selected and 

introduced at the 5' and 3' ends, respectively. Thirdly, 
it was thought desirable to introduce a series of 
restriction endonuclease recognition sites strategically 
placed along the molecule to enable the gene to be 
30 specifically disected to aid characterisation and, 

possibly mutagenesis. Also, this measure allowed the 
two portions of the molecule to* "be cloned in stages. In 
particular, an Xba I site was introduced at a central 
location in the gene. Fourthly, the synthesis should 
35 not be unnecessarily complicated and illegitimate cross- 
hybridisations should be minimised in order to facilitate 
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gene assembly. Using a computer, energies of interactions 
arising from all possible approximations were calculated 
(see, for example, Tinoco, Jr., I., et ajL , (1971), Nature, 
230 , 362-367; Powers, G.J., et al , (1975), JACS, 97, 
5 875-889) , so that stable off-diagonal interactions might 
be avoided where possible. Fifthly, since the protein 
ultimately expressed in bacterial cells will be in the 
form of a fusion product, it was desirable to have a means 
of cleaving the urogastrone portion from such a fusion 
10 product. Since urogastrone is known to be insensitive to 
trypsin (see, for example, Gregory and Preston, loc cit) , 
the codons specifying the dipeptide lys-lys were introduced 
near the end of the gene corresponding to the urogastrone 
N-terminus in order to serve as a substrate for tryptic 
1 5 pro t eo 1 y s is . 

The present invention relates to a synthetic 
gene characterised in that it codes for the expression 
of urogastrone or a sub-unit thereof. . 

The particular preferred sequence selected for 
20 the coding portion of the synthetic urogastrone gene is 
as follows: 

5' AATTCCGATAGCGAGTGTCCTCTG- 
3 1 TTAAGGCTATCGCTCACAGGAGAC- 
AGTCACGATGGTTACTGTCTACATGACGG- 
25 TCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACA- 
GCAGACATACATATAACTCCGAGATCTGT- 
AGTACG CGTGTAATTGCGTTGTTGGCTAC — 
TCATGCGCACATTAACGCAACAACCGATG- 
30 ATCGGTGAGCGCTGTCAGTATCGAGATCT- 
TAGCCACTCGCGACAGTCATAGCTCTAGA- 
GAAATGGTGGGAACTTAGA 
CTTTACCACCCTTGAATCT 

One aspect of the present invention is directed 
35 accordingly. A sub-unit of such a sequence also constitutes 
an aspect of the present invention. 
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In fact, bearing in mind, inter alia, the 
above-mentioned considerations, for convenience, a 
slightly longer sequence was selected which is as 
follows: 

Coding: AGCTTAAAAAGAATTCCGATAGCGA GTT 
Non-Coding: ATTTTTCTTA'AGGCTATGGCTCA 
GTCCTC-TGAGTCACGATGGTTACTGTCT- 
CAGGAGACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
TGTACTGCCGCAGACATACATATAACTC- 
GCTCTAGACAAGTACGCGTGTAATTGCG- 
CG-AGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT- 
. GTATCGAGATCTGA'AATGGTGGGAACTT,— 
CATAGCTCTA'GACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

The whole of this expanded sequence is shown in 
Figure 1 of the accompanying drawings and the restriction 
enzyme recognition sites are shown in Figure 2 of the 
accompanying drawings • 

Another aspect of the present invention is 
directed accordingly. A sub-unit of such a sequence also 
constitutes an aspect of the present invention. 

In the above sequences, it is to be appreciated 
that minor variations are possible without departing from 
the present inventive concept.. 

The present invention also relates to a process 
.for the production of such a synthetic gene or a sub- 
unit thereof characterised in that it comprises the 
assembly and ligation of a number of oligonucleotide 
blocks. 

It was in fact decided to synthesise a molecule 
having the above expanded sequence by making 23 synthetic 
oligonucleotide blocks as illustrated in Figure 3 of the 
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accompanying drawings, which will assemble by single- 
strand overlaps to give the complete double-stranded 
nucleotide sequence . 

In order to minimise strong illegitimate 
interactions near the centre of the molecule during 
assembly, certain blocks of modified sequence have also 
been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

As mentioned above, the synthetic blocks 
selected are shown in Figure 3 of the accompanying 
drawings. The blocks may be constructed using known 
synthesis techniques (see, for example, Agarwal, et al , 
(1970), Nature, 327 , 27-34; and Crea, et al, (1978), Proc. 
Natl. Acad. Sci. USA , 2£, 5765-5769). 

The synthetic methods will now be illustrated 
with reference to the synthesis of the tetradecanucleotide 
ApGpTpTpCpCpCpApCpCpApTpTpT . 

The methods of building up oligonucleotides 
from smaller units by successive coupling reactions are well 
known (see, for example, Hsiung, et al^, (1979) , Nucleic 
Acid Research, 1371-1385) . The completely protected 
tetradecanucleotide was built up as shown in Figure 
5 of the accompanying drawings (wherein, for convenience, 
protecting groups are not shown) . 

The condensation reactions indicated by arrows 

in Figure 5 were carried out by the following procedure 

B z iBu 

exemplified in the synthesis of: DMTr A p G p-CNEt 

• • 

0C1 0C1 

as shown in Figure 6:* of: the accompanying drawings.. From- ; 
* 1.1 to 1.5 mmole of the 3 '-phosphodi ester component (I) 
was condensed with 1.0 mmole of the 5* -hydroxy 1 component 
(II) in anhydrous pyridine in the presence of from 3 to 
4.5 mmoles of 2,4,6-triisopropyl-benzene-sulphonyl- 
tetrazolide. The reaction was left for 1 hour at room 
temperature or until chromatography on silica TLC plates 
eluted with 10% (v/v) methanol in chlorofrom showed that 
the hydroxy 1 component was exhausted. The reaction was 
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quenched with 5% (w/y) sodium bicarbonate solution and 
extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography 
column (ODS bonded to 15-25 micron silica) . The fully . 
protected dinucleotide product (III) was eluted with a 
solvent gradient from chlorof ornumethanol : water (2:6:3 v/v) 
to chlorof orm:methanol: water (2:6:0.5 v/v). The product 
(III) was extracted into chloroform and dried. The 
final isolated yield was 81%. 

To proceed to further condensations, the 
terminal protecting group (DMTr or CNEt) was removed 
selectively using triethylamine in pyridine (CNEt) or a 
2% (w/v) solution of benzene sulphonic acid in chloroform: 
methanol (DMTr) as shown in Figure 7 of the accompanying 
15 drawings. 

At the completion of the synthesis, all of the 
protecting groups were removed by sequential treatment 
with 0.1 M tetraethylammonium fluoride in THF/pyridine/ 
| water (8:1:1 v/v), ammonia and 80% acetic acid. The 

* 20 • deprotected oligonucleotides were purified by ion-exchange 

HPLC and sequence analysis was carried out by the method 
of Wu, et al, (1976), Anal. Biochem., 74, 73-93. 

The oligomeric blocks of nucleotides .were 
hybridised and ligated (see, for example, Agarwal, et al, 
loc cit) in a series of steps, in order to minimise the 
possibilities for undesirable interactions, leading to 
the formation of the two portions as shown in Figure 8 
of the accompanying drawings. The order of the additions 
in the assembly scheme was-, optimised for minimal incorrect 
ligations and in the case of especially difficult oligo- 
meric blocks, notably 7 and 8, sub-molar quantities were 
used in order to remove all monomeric units before further 
additions were made. 

In more detail: 

35 Left-hand portion: Blocks 1 and 2 were ligated 

to form a dimer about the Hind III site. Blocks 4, 5 and 
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6 and 8 r 9 and 10 were also ligated in the first round 
of the assembly scheme. Molar equivalents were used for 
all but block 8 where 0.75 molar equivalents were 
employed. 

Block 3 was ligated with 1 + 2, block 7 
(0.75 molar equivalent) with 4 to 6 and block 11 with 
8 to 10. The 8+9+10+11 assembly has one flush end, 
hence some blunt-end dimerisation was observed. 

1 to 3 and 4 to 7 were ligated and finally 8 
to 11 were ligated to the resulting 1 to 7 species . 

The diraeric 1 to 11 left-hand portion was then 
cleaved by Hind III (EC 3.1.23.21) and Xba I (EC 3.1.23.4) 
to generate the monomeric left-hand portion, with the 
correct cohesive termini to allow construction of 
recombinant plasmids. 

Right-hand portion: Blocks 12 and 13 were 
ligated to form a dimer about the Xba I site and blocks 
20, 23 and 22 similarly ligated to form a: dimer about the 
Bam HI site. Blocks 14, 15 and 17, and 16, 18 and 19 were 
also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly 
were ligated, as were the 16, 18, 19 assembly with the 20, 
22, 23 dimer, where block 21 was used as a joining section. 

These two species were then ligated to give an 
oligomeric molecule which was cleaved by Xba I and Bam HI 
(EC 3.1.23.6) to give the monomer 12 to 23 species having 
the correct cohesive termini to allow construction of 
recombinant plasmids. 

The present invention further relates to a 
plasmid recombinant characterised in that it comprises a 
plasmid vector having inserted therein at an appropriate 
insertion site such a synthetic gene or a sub-unit thereof, 
the plasmid recombinant enabling translation. in the 
correct phase for the raRNA corresponding to the inserted 
gene or sub-unit thereof and having a* bacterial promoter 
upstream of and adjacent to the insertion site such that 
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the inserted gene or sub-unit thereof is under bacterial 
promoter control. 

A particular vector, designated pLFJ., has 
been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid, 
pLF 1, is a 5K bp plasmid which may be propagated in 
E. coli and which may be constructed from pBR_322 and 
pUB 110 (see, for example, Gryczan, T.J., et al, (1978), 
J. Bacterial., 134, 318) by inserting the DNA sequence 
10 of the S. aureus - plasmid pUB^HO between the EcoRI and 
Bam HI sites thereof (comprising approximately 870 bp) 
between the EcoRI and Bam HI sites of pBR^322, thereby 
replacing that region of pBR^322." 

The present invention further relates to a 
15 process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a 
synthetic gene or a sub-unit thereof into an appropriate 
insertion site of an appropriate plasmid vector. 

The following illustrates the present 

20 invention: 

Phosphorylation of oligonucleotide^blocks: In 
each case, 6 ^g of oligomer and 60 pC± of [if- P] ATP 
(>5000 Ci/mMol) were dried and redissolved in a final . 
buffer concentration of 50 mM Tris-HCl, pH 7.8, 10 mM 
25 MgCl 2 , 0.25 mM ATP, 10 mM B-mercaptoethanol and incubated 
with 4 units (1 unit is the amount that catalyses the 
production of 1 n mole of acid-insoluble P after 
incubation for 30 minutes at 37 °C according to Richardson, 
C.C.,(1972), Progress in Nucleic Acids Research, 2, 815) 
30 of T4 polynucleotide kinase (EC 2.7.1.78, Bethesda Research 
Labs) at 37 °C for 15 minutes. The enzyme was subsequently 
inactivated by a 5 minute incubation at 100 °C. 

Ligation of oligonucleotide blockss Except 
where indicated above, equimolar quantities (6 to 18 pq) 
35 of oligonucleotide blocks were incubated in 50 mM Tris- 
HCl, pH 7.8, 10 mM MgClj, 1 mM ATP, 20 mM dithiothreitol 
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(DTT) with 6 units (Limit is the amount that catalyses 

32 32 
the conversion of 1 n mole of PPi into (a/0 P)-ATP 

in 20 minutes at 37°C according to Weiss, B., et al , 

(1968), J. Biol, Chem., 243 , 4543) of T4 DNA ligase 

(EC 6,5. 1.1, Bethesda Research Labs) at 25 °C for from 

3 to 16 hours. Ligated DNA was precipitated by addition 

of 2.5 vol absolute ethanol, collected by centrifugation 

and redissolved in water. 

Purification of ligated species: Ligated 
were electrophoresed* on 20% (w/v) polyacryl amide 
oligonucleotide blocks/in S>0 mM Tris-HCl, pH 8.3, 90 mM 

boric acid, 2.5 mM EDTA (TBE buff er) , and the fragments 

located by autoradiography. Slices of gel containing 

the fragments were excised and the DNA electroeluted 

at 1 mA in TBE buffer onto 0.5 ml of DEAE cellulose 

(DE52, Whatman) for a few hours. After extensive 

washing by 0.1KNH 4 OAc, 2 mMMg(0Ac) 2 , 0.02% (w/v) SDS, 

0.02 mM EDTA (AGEB buffer), the DNA was eluted from 

the DEAE cellulose by 2 ml of 1.1 M NaCl in AGEB buffer 

and precipitated by addition of 2.5 vol. absolute 

ethanol . 

Construction of chimeric cloning vector pLF^l: 
In order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or 
vectors having Hin d III, Xba I and Bam HI cleavage 
sites, see Figure 9 of the accompanying drawings. 
However, no readily available E. coli plasmids possess 
Xba I sites. It was surprisingly noticed that the 
S- aureus plasmid pUB^llO, which cannot be propagated in 
E. coli (see, for example, Keggins, K.M., et al_ (1978), 
Proc. Natl. Acad. Sci. U.S.A., 75., 1423), contains a 
region of about 1 kbp-- of DNA bounded by sites for 
EcoRI and Bam HI having an approximately central Xba I 
site. Therefore, pUB^llO was cleaved with Eco RI (EC 
3.1.4.32) and Bam HI and the DNA fragments electrophoresed 
on 5% (w/v) polyacrylamide. The approximately 1 kbp 
Eco RI / Bam HI fragment was removed by electroelution from 
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the excised gel slice onto DEAE cellulose, eluted by 
1.1 M NaCl and ethanol precipitated. The E. coli plasmid 
pAT 153 (see, for example, Twigg, A.J., and Sherratt, 
D.^dSSO), Nature, 283. 216-218) was also cleaved with 
EcoRI and Bam HI, and the 3282 bp fragment purified by 
electroelution as above. These two purified DNA fragments 
were ligated in eguimolar quantities using 10 units of 
T4 DNA ligase in 50 mM Tris -HCl,.pH 7.8, 10 mtt MgCl 2 , 
1 mM ATP, 20 mM DTT at 15 °C for 18 hours. After ethanol 
precipitation, the ligated DNA was dissolved in 100 ;il 
10 mM Tris pH 7.5, 10 mM MgCl 2 , 10 mM CaCl 2 and 
transformed into E. coli K12 HB^lOl (genotype gal"", lac , 
ara", pro", erg", str r , rec A",r£, M k ; see, for example, 
Boyer, H.W., and Roullard-Dussoix, D., J. Mol. Biol., 
41, 459-472) using known methods (see, for example, Cohen, 
et al , (1972), Proc. Natl, Acad. Sci. U.S.A., 69, 2110- 
2114) and transformants resistant to 100 ug/ml ampicillin 
selected.. Several transformants were analysed further by 
restriction enzyme cleavage of plasmid DNA, and one full 
length clone, designated pLF^l, selected for further use 
as a cloning vector. At a later stage, an additional 
Hin d III restriction sequence was created at the EcoRI 
site. 

Cloning of the synthetic urogastrone gene in 
pLF lr The two portions of the assembled urogastrone 
gene -..-were cloned in two transformation stages, as illustrated 
in Figure 10 of the accompanying drawings. 

Right-hand portion: The longer fragment of 
Xba I, Bam Hl-cleaved pLF^l was purified by 
electroelution as above and ligated to a large excess of 
the right-hand portion assembly of the urogastrone gene 
using 10 units T4 DNA ligase in 50 mM Tris-HCl, pH 7.8, 
10 mM MgCl 2 , 1 mM ATP, 20 mM DTT at 15 °C for 18 hours. 
This was transformed into E. coli as before, with 
selection for 100 ug/ml ampicillin. Several transformants 
were selected for plasmid analysis by restriction enzymev 
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cleavage analysis and one clone, designated pUR 1, used 
for further cloning. The sequence of the inserted 
urogastrone gene DNA was confirmed by chemical degradation 
analysis (see, for example, Maxam f A., and Gilbert, W. , 
(1977), Proc. Natl. Acad* Sci* U.S.A., 74, 560-4) ♦ 

In order to have a Hin d III site for ligation 
to the 5 1 end of the left-hand portion of the gene, 
the Eco RI site was modified as follows: P^R^l wa s 
cleaved with EcoR I and the resulting recessed ends filled 
using 5 units (1 unit is the amount that incorporates 
10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37 °C using poly-d(A-T) as 
primer according to Richardson, C.C., et al, (1964), 
J. Biol. Cheiru, 239 , 222) of DNA polymerase I (EC 2.7.7.7) 
in 50 mM Tris-HCl, pH 7.5, 10 mM MgCl 2 , 1 mM 8-mercapto- 
ethanol with 0.25 mM of each deoxynucleotide for 30 minutes 
at 15°C. 

To the resulting flush ends, were ligated a 
large excess of synthetic Hin d III linkers (Collaborative 
Research) by the known blunt end ligation procedure (see, 
for example, Ullrich, A., et al, (1977), Science, 196 , 
1313-1319) . The DNA was ligated using 6 units of T4 DNA 
ligase in 50 mM Tris-HCl, pH 7.8, 10 mM MgCl 2 , 1 mM ATP, 
20 mM DTT at 15 °C for 18 hours. 

After Hind III restriction cleavage and 
purification, the full length plasmid was religated and 
transformed into E. coli as above, with selection for 
100 /ig/ml ampicillin. A transformant having a Hin d III 
site, designated pUR 2, was selected for further cloning. 

Left-hand portion: pUR^ was cleaved with 
Hin d III and Xba I restriction enzymes and the longer 
fragment purified by electroelution as above. This was 
ligated to a two-fold molar excess of the assembled 
left-hand portion of the urogastrone gene using 6 units 
T4 DNA ligase in 50 mM Tris-HCl, pH 7*8, 10 mM MgCl 2 , 
1 mM ATP, 20 mM DTT for 18 hours at 15 °C. This was 
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transformed into E. coli K12 MRC 8 (genotype dap 103 
hsd R met Bl glm 533 upp 1 dap 101 sup E thy A 103 deo 
rec Al) with selection for 100 ;ig/ml ampicillin. Several 
transformants were selected for plasmid analysis by 
restriction enzyme cleavage. One clone, designated pURj: 
was used for further characterisation and cloning. The 
sequence of the full urogastrone gene was confirmed 
by chemical degradation analysis. It should be noted 
that pOR^I has no remaining S aureus DNA sequence 
present. 

The present invention also relates to a cell, 
in particular an E. coli cell, characterised in that it 
comprises inserted therein such a synthetic gene or a sub- 
unit thereof or such a plasmid recombinant. 

The present invention further relates to a 
process for the production of such a cell characterised 
in that it comprises inserting such a synthetic gene or 
a sub-unit thereof or such a plasmid recombinant into 
a cell. 

20 Expression of urogastrone in E. coli: The 

urogastrone gene insert was cleaved from pORj. by Hind 
III and Bam HI cleavage and purified by polyacryl amide 
gel electrophoresis and electroelution as above. This 
fragment was ligated to Hind III, Bam Hl-cleaved pWTJ.21 
25 and pWT 221, (see, for example, Tacon, W.C.A., .et al , 

(1980),~Molec. Gen. Genet. 177, 427) and the recombinant 
molecules used to transform E.coli MRC 8, (see, for 
example, Emtage, J.S., et al , (1980), Nature, 283, 171- 
174) , with selection for 100 jig/ml ampicillin. Trans- 
30 formants containing full length urogastrone genes were 

characterised by restriction enzyme cleavage analysis and 
DNA/purified by isopycnic centrifugation in caesium 
chloride . 

Expression of urogastrone-like fusion protein 
35 was induced by growth of cells in L-broth (luria broth: 

1% (w/v) bacto tryptone., 0.5% (w/v) bacto yeast extract, 
0.5% (w/v) NaCl, 0.2% (w/v) glucose, 0.004% (w/v) 
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thymine, pH 7) containing 100 ^g/ml ampicillin to an 
A600 nra of 0.3 . Following centrifugation, the cells 
were washed and resuspended in M9 medium lacking 
tryptophan, but containing 20 ^g/ml 3 p-indole acrylic 
acid. The cells were incubated at 37 °C for 4 hours. 
Under these conditions maximal tryptophan promoter 
activity is known to occur (see Tacon, et <U, loc cit) , 
and hence expression of the urogastrone fusion protein. 

The present invention also relates to a 
process for the production of urogastrone or a sub- 
unit thereof characterised in that it comprises*** \» 
culturing such a cell and recovering expressed protein. 

As mentioned above, urogastrone has an 
application in the treatment of ulcers, and also in other 
instances where the growth promoting activity thereof 
would be beneficial, for example, in wound healing. 
Conventional administration forms may be used, the active 
material being used in an effective amount, for example 
from 0.1 to 1.0 ^g/kg body weight, preferably from 0.125 
to 0.5 ^ig/kg body weight, more preferably about 0.25 
^ig/kg body weight, optimally together with a 
conventioanl pharmaceutically-acceptable carrier, diluent 
or adjuvant. 
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CLAIMS: 



1. a synthetic gene characterised in that it codes 
for the expression of urogastrone or a sub-unit thereof. 

2. a synthetic gene is claimed in claim 1 
characterised in that it comprises the following sequence 
or a sub-unit thereof : 

Coding: aattccgatagcgagtgtcctctg- 

NOH-COding: TTAAGGCTATCGCT.CACAGGAGAC- 
AGTCACGATGGTTACTGTCTACATGACGGC- 
TCAGTGCTACCAATGACAGATGTACTGCCG- 

GTCTGTATGTATATTGAGGCTCTAGACAAG- 
CAGACATACATATAACTCCGAGATCTGTTC- 

TACGCGTGTAATTGCGTTGTTGGCTACATC- 
ATGCGCACATTAACGCAACAACCGATGTAG- 

GGTGAGCGCTGTCAGTATCGAGATCTGAAA- 
CCACTCGCGACAGTCATAGCTCTAGACTTT- 

TGGTGGGAACTTAGA 
ACCACCCTTGAATCT 

3. A synthetic gene as claimed in claim 1 or claim 2 

characterised in that it comprises the following sequence 
or a sub-unit thereof: 

Coding: AGCTTAAAAAGAATTCCGATAGCGAGT 
Non-COding: ATTTTTCTTAAGGCTATCGCTCA 
.GTCCTCTGAGTCACGATGGTTACTGTCT- 
CAGGAGACTCAGTGCTACCAATGACAGA- 
A C A T G A C G G C G T C T G T A T G T A T A T T G A G - 
TGTACTGCCGCAGACATACATATAACTC- 
GCTCTA. GACAAGTACGCGTGTAATTGCG- 
CGAGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT- 
GTATCGAGATCTGAAATGGTGGGAACTT- 
CATAGCTCTAGACTTTACCACCCTTGAA- 
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A G A T A A G 
TCTATTCCTAG 

4. A process for the production of a synthetic gene 
or a sub-unit thereof as claimed in any of claims 1 to 3 
characterised in that it comprises the assembly and 

7. ligation of a number of oligonucleotide blocks. 

5. A plasmid. recombinant characterised in that it 
comprises a plasmid vector having inserted therein at an 
appropriate insertion site a synthetic gene or a sub-unit 
thereof as claimed in any of claims 1 to 3, the plasmid 
recombinant enabling translation in the correct phase for 
the mRNA corresponding to the inserted gene or sub-unit 
thereof and having a bacterial promoter upstream of and 
adjacent to the insertion site such that the inserted gene 
or sub-unit thereof is under bacterial promoter control. 

6. A plasmid recombinant as claimed in claim 5 
characterised in that it comprises as the plasmid vector 
pLFl. 

7. A process for the production of a plasmid 
recombinant as claimed in claim 5 characterised in that it 
comprises inserting a synthetic gene or a sub-unit thereof 
as claimed in any of claims 1 to 3 into an appropriate 
insertion site of an appropriate plasmid vector. 

8. A cell characterised in that it comprises 
inserted therein a. synthetic gene or a sub-unit thereof 

as claimed in any of claims 1 to 3 or a plasmid recombinant 
as claimed in claim 5 or claim 6. 

9. A process for the production of a cell as claimed 
in claim 8 characterised in that it comprises inserting a 
synthetic gene or a sub-unit thereof as claimed in any of 
claims 1 to 3 or a plasmid recombinant as claimed in claim 

5 or claim 6 into a cell. 




0046039 



a process for the production of urogastrone or 
a sub-unit thereof characterised in that it comprises 
culturing a cell as claimed in claim 8 and recovering 
expressed protein. 
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-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lvs 
coding ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



Non-coding T T C 



1 5 
Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGTACTG C C GC AG 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fig .1 (Part lof 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg 
tgtaatt6cgttgt T GGC TaCATCGGTGAGCGC 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 



Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 



ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 



55 

Arg Ter Gly Ser 
AGATAAGGATCC coding 



TCTATTCCTAGG Non-coding 



Fig.1 (Part 2 of 2) 
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Hind II I 

-10 -5 AluI 

Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lys 

ATGCAAACACAAAAACCGACTCCAAGCTCC A A G 

• i • # » 

TT C 



EcoRI _ Mnll 

I 

? A A T T T T T C T T A A GGCTATCGCTCACAGGAGAC 
I * 



EcoRI Mnll 

Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTGCTCTG 



Hinfl ^ AccI Hgal 

A 5 ^ T Asp Gly Tyr Cvs Leu His Asp Gly Val 
AGT.CACGATGGTTACTG T*C T A C A T G AC G G C G T C 



T? A 9 T6CTA C CAAT GACAGATGT*ACTGCCGCAG 

«•••«• Hgal 



Mn1I • Xbal FnuDII 
20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 

Hhal 



Fig. 2 (Part 1of 2) 
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HphI Haell 
35 AO 
Cys Asn Cys Val Val Gly Tyr He GW Glu Arg 
T 6 T A A T T G CGTTGTTGbC T A CATCGGTGAGC G C 

•* I 

ACATTAACGCAACAACCGATGTAGCCACTCGCG 

+++++♦♦-»+ »!•«••• 

«i 

Mbol 
TaqI Bglll 

45 50 
Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
T G T OA GTATCGAGATCTGAAATGGTGGGAACTT 



A C A+GTCAT AGCTC TAGACTT T ACC ACCC TTGAA 
HphI I* •••"•«! 





Bam HI 
Mbol 

Arg Ter Gly Ser 
AGAT AAGGATCC 

1 ***•••* 

T CTAT T CCTAGG 

•* **r 



Fig. 2 (Part 2 of 2) 
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coding A G 
Non-coding — 



1 5 

Leu Lys Lys Ash Ser Asp Ser Glu Cys Pro Leu 

CTTAAAAaGAATTCCG AT AGCGAGTGTCC TCTG 
, j 

I 03) HI (18) 



—• ATTTT TCTT A AG G C T A T C G C T C AC A G G A G A C 

I 1 

11(15) IV (18) 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asd GIv Val 
A G T C A C G A T G G» T T AC T G T C T A CAT GA P C 6% 6 T C 

, — i- 

V (18) VII (18) 

TCAGTGCTACCAATGACAGATGTACTGCCGCAG 

— , ! 

VI (18) VIII (18) 



20 25 30 

Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala 

TGTATGTATATTGAGGCTCTAGACAAGTACGCG 
! - _! 

IX (12) XI (12) I- 

XIII (18) 

ACATACjATATAACTCCGAGATCTGTTCATGCGC 

X (18) I-r 

XII (20) 



Fig. 3 (Partlof 2) 
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35 40 
Cys Asn Cys Val Val Gty Tyr He Gly Glu Arg 
TGTAATT GCGTTG TTGGCTACATCGGTGAGCGC 

XV (16) " I X VIl" (13) 1 " 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 
1 1 

XII (20) XIV (16) XVI (12) 



45 50 
Cys Gin Tyr Arg Asp Leu • Lys Trp Trp Glu Leu 

TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

, 1 

XIX (15) XXI (12) XXIII (14) 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 

! ! 1- 

XVIII (14) XX (14) 



Arg Ter 

AG ATA AG Coding 



XXIII (14) 



liJAUfiH? Non-coding 
XXII (12) 



Fjg.3 (Part2of2) 
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coding A G 
I— 

Non -coding — 



1 5 

Leu Lvs Lys Asn Ser Asp Ser Glu Cys Pro Leu 

CTTAAAAAGAATTCCG at agcgagt(?tcctc T G 
, _! 

I (13) III (18) 



-atttttcttaaggctatcgctcacaggagac 

! j 

11(15) IV (18) 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGTGTC 
1 „_!_ 

V (18) VII*(18) 

TCAGTGCT AC CAATGACAGATGTAC TGCCACAG 

— I— 1 - 

VI (18) VIII*(13) 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATCGAAGCTCTAGACAAGTACGCG 

IX (12) XI*(12) I 

XIII (18) 

ACATACATATAGCTTCGAGATCTGTTCATGCGC 
, 1 

X *(18) I 

XII (20) 



Fjg .4 (Part lof 2) 
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35 40 
Cys Asn Cys Vat Val Gly Tyr He Gly Gtu Arg 
TGTAATTGCGTTGTTGGCTACATCGGTGAGCGC 

XV (16) 1 XV1M13) I_ 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 
1 , 

XII (20) XIV (16) XVI (12) 



45 50 
T C X S T ^ G i" _Jy r _ Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

1 I 

XIX (15) XXI (12) XXIII (1A) 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 
XVIII (14) 1 ~~"~xx7tt)~ _I " 



Arg Ter 

AG A T A A G coding 
XXIII (14) 



L?.IALLi51?.?"Non-codinfl 



XXII (12) 



Fjg . 4 (Part 2of 2) 
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Fig.8 (Part 1 of 2) 
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Right-hand portion 
13 15 17 



19 



21 



23 




16 



18 



19 



16 v 18 




20 



22 



23 . 22 « 20 

20 " 22 * 23 
(dimer) 



14 . 12 . 13 . 15 .17 



19 . 21 . 23 . 22 . 20 . 18 . 16 
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f * a 12 » 13 
L 17 * 15 * 13 *~ 
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Fig.8 ( Part 2 of 2 ) 
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Description 

This invention relates to a synthetic orogastrone gene, to corresponding plasmid recombinants and 
transformed cells, to the production thereof and to urogastrone expression. 

5 Urogastrone is a polypeptide hormone (protein) synthesised in the duodenum and in the salivary 
glands of normal humans, (see, for example, Heitz, et al, (1978), Gut, 19, 408—413). Urogastrone 
suppresses the secretion of gastric acid and promotes cell growth, (see, for example, Elder, era/, (1975), 
Gut, 16, 887 — 893). Therefore, it has an application in the treatment of ulcers and in the promotion of 
wound healing. Urogastrone is excreted in small amounts in human urine and may be isolated therefrom. 

io There exists, however, a need for a more viable commercial production thereof and such is provided 
according to the present invention. 

Urogastrone is known to consist of 53 amino acids in the following sequence: 

Asn Ser Asp Ser G!u Cys Pro Leu Ser His Asp- 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr- 
15 He Glu Ala Leu Asp Lys Tyr Aia Cys Asn Cys- 

Val Val Gly Tyr Me Gly Gfu Arg Cys Gin Tyr- 
Arg Asp Leu Lys Trp Trp Glu Leu Arg 

(see, for example, Gregory, H., and Preston, B. M., (1977), Int. J. Peptide Protein Res., 9, 107—118). 

From the above amino acid sequence, a corresponding synthetic gene sequence has been invented, 
subject to a number of specific non-obvious criteria, and oligonucleotide blocks synthesised which, when 
assembled, form a synthetic gene coding for urogastrone. The blocks have been hybridised and ligated in 
pre-determined stages to construct the urogastrone gene in two portions. These have been cloned in two 
operations into a new specifically-designed chimeric £ coli/S. aureus vector so as to produce a full length 
urogastrone,gene flanked only by £ coli plasmid DNA. The gene has been excised from this recombinant 
and re-cloned into vectors specifically designed to maximise expression of the gene in £ coli, under the 
control of the promoter obtained from the £ coli tryptophan operon. A protein resembling human 
urogastrone has thus been expressed in £ coli. 

From the above amino acid sequence, because of the degeneracy of the genetic code, it is possible to 
predict numerous necleotide sequences which would code for the protein. 

In the inventive determination of an optimum sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, trinucleotide codons should be used which are acceptable 
or preferable in the cells to be used, in particular £ colL Secondly, it was decided that it was desirable to 
have different restriction enzyme recognition sites at the termini of the molecule so as to allow insertion 
into a plasmid in a desired orientation. Moreover, it was decided to select sites which allowed the use of 
well-understood cloning vectors, such as pBR322 (see, for example, Bolivar, F„ et al, (1977), Gene, 2, 
95— 1 13). In fact. Hind III and Bam HI sites were selected and introduced at the 5' and 3' ends, respectively. 
Thirdly, it was thought desirable to introduce a series of restriction endonuclease recognition sites 
strategically placed along the molecule to enable the gene to be specifically disected to aid characterisation 
and, possibly mutagenesis. Also, this measure allowed the two portions of the molecule to be cloned in 
stages. In particular, an Xba I site was introduced at a central location in the gene. Fourthly, the synthesis 
should not be unnecessarily complicated and illegitimate cross-hybridisations should be minimised in 
order to facilitate gene assembly. Using a computer, energies of interactions arising from all possible 
approximations were* calculated (see, for example, Tinoco, Jr., I., et al, (1971), Nature, 230, 362—367; 
Powers, G. J. ef al, (1975), JACS, 97, 875(889), so that stable off-diagonal interactions migfcit be avoided 
where possible. Fifthly, since the protein ultimately expressed in bacterial cells will be in the form of a 
fusion product, it was desirable to have a means of cleaving the urogastrone portion from such a fusion 
product Since urogastrone is known to be insensitive to trypsin (see, for example, Gregory and Preston, 
foe tit), the codons specifying the dipeptide lys-iys were introduced near the end of the gene corresponding 
to the urogastron N-terminus in order to serve as a substrate for tryptic proteolysis. 

The present invention relates to a synthetic gene characterised in that it codes for the expression of 
urogastrone or an equivalent thereof or a sub-unit thereof in a bacterial cell and comprises the following 
. sequence or sub-unit thereof: 

5'AATTCCGATAGCGAGTGTCCTCTG- 
55 3'TTAAGG CTATCGCTCACAGGAGAC- 

AGTCACGATG GTTACTGTCTACATGACG G- 
TCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACA- 
GCAGACATACATATAACTCCGAGATCTGT- 
60 AGTACGCGTGTAATTGCGTTGTTGGCTAC- 
TCATGCG CACATTAACGCAACAACCGATG- 
ATCGGTGAGCGCTGTCAGTATCGAGATCT- 
TAGCCACTCG CGACAGTCATAGCTCTAGA- 
GAAATGGTGGGAACTTAGA 
65 CTTTACCACCCTTG AATCT 
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In fact, bearing in mind, /infer alia, the above-mentioned considerations, for convenience, a slightly 
longer sequence was selected which is as follows: 

Coding 5'AGCTTAAAAAGAATTCCGATAGCGAGT- 
5 Non-Coding 3'ATTTTTCTTAAG G CTATCG CTCA- 

GTCCTCTGAGTCACGATGGTTACTGTCT- 
CAG GAGACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
TGTACTGCCGCAGACATACATATAACTC- 
to GCTCTAGACAAGTACGCGTGTAATTGCG- 
CGAGATCTGTTCATGCGCACATTAACGC- 
TTGTTGGCTACATCGGTGAGCGCTGTCA- 
AACAACCGATGTAGCCACTCGCGACAGT- ' 
GTATCGAGATCTGAAATGGTGGGAACTT- 
15 CATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

The whole of this expanded sequence is shown in Figure 1 of the accompanying drawings and the 
20 restriction enzyme recognition sites are shown in Figure 2 of the accompanying drawings. 

The present invention also relates to a process for the production of the above said synthetic genes or a 
sub-units thereof characterised in that it comprises the assembly and ligation of a number of 
oligonucleotide blocks. 

Such "blocks" should not be regarded as sub-units of the gene. In the present context, "sub-unit" 
25 refers to a sequence which is less than the whole, but which exhibits the desired properties. 

It was in fact decided to synthesise a molecule having the above expanded sequence by making 23 
synthetic oligonucleotide blocks as illustrated in Figure 3 of the accompanying drawings, which will 
assemble by single-strand overlaps to give the complete double-stranded nucleotide sequence. 

In order to minimise strong illegitimate interactions near the centre of the molecule during assembly, 
30 certain blocks of modified sequence have also been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

As mentioned above, the synthetic blocks selected are shown in Figure 3 of the accompanying 
drawings. The blocks may be constructed using known synthesis techniques (see, for example, Agarwal, et 
al, (1970), Nature, 227, 27—34; and Crea, et af, (1978), Proc. Natl. Acad. Sci. U.S.A., 75, 5765—5769). 
35 The synthetic methods will now be illustrated with reference to the synthesis of the 
tetradecanucleotide ApGpTpTpCpCpCpApCpCpApTpTpT. 

The methods of building up oligonucleotides from smaller units by successive coupling reactions are 
well known (see, for example, Hsiung, ef al, (1979), Nucleic Acid Research, 6, 1371—1385). The completely 
protected tetradecanucleotide was built up as shown in Figure 5 of the accompanying drawings (wherein, 
40 for convenience, protecting groups are not shown). 

The condensation reactions indicated by arrows ?n Figure 5 were carried out by the following 
procedure exemplified in the synthesis of: 

DMTr A Br p G ,Bu p-CNEt 

45 : : 

<j)CI <|>CI 

as shown in Figure 6 of the accompanying drawings. From 1.1 to 1.5 mmole of the 3'-phosphodiester 
component (I) was condensed with 1.0 mmole of the 5'-hydroxyl component (II) in anhydrous pyridine in 

so the presence of from 3 to 4.5 mmoles of 2,4,6-triisopropyl-benzene-sulphonyl-tetrazolide. The reaction was 
left for 1 hour at room temperature or until chromatography on silica TLC plates eluted with 10% (v/v) 
methanol in chloroform showed that the hydroxy! component was exhausted. The reaction was quenched 
with 5% (w/v) sodium bicarbonate solution and extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography column (ODS bonded to 15 — 25 micron silica). The 

55 fully protected dinucleotide product (III) was eluted with a solvent gradient from 
chioroform:methanol:water (2:6:3 v/v) to chloroform: methanol: water (2:6:0.5 v/v). The product (III) was 
extracted into chloroform and dried. The final isolated yield was 81%. 

To proceed to further condensations, the terminal protecting group (DMTr or CNEt) was removed 
selectively using triethylamine in pyridine (CNEt) or a 2% (w/v) solution of benzene sulphonic acid in 

so chloroform: methanol (DMTr) as shown in Figure 7 of the accompanying drawings. 

At the completion of the synthesis, all of the protecting groups were removed by sequential treatment 
with 0.1 M tetraethylammonium fluoride in THF/pyridine/water (8:1:1 v/v), ammonia and 80% acetic acid. 
The deprotected oligonucleotides were purified by ion-exchange HPLC and sequence analysis was carried 
out by the method of Wu, et al, (1976), Anal. Biochem., 74, 73—93. 

65 The oligomeric blocks of nucleotides were hybridised and ligated (see, for example, Agarwal, et af, loc 
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cit) in a series of steps, in order to minimise the possibilities for undesirable interactions, leading to the 
formation of the two portions as shown in Figure 8 of the accompanying drawings. The order of the 
additions in the assembly scheme was optimised for minimal incorrect ligations and in the case of 
especially difficult oligomeric blocks, notably 7 and 8, sub-molar quantities were used in order to remove 
5 all monomeric units before further additions were made. 
In more detail: 

Left-hand portion: Blocks 1 and 2 were ligated to form a dimer about the Hind III site. Blocks 4, 5 and 6 
and 8, 9 and 10 were also ligated in the first round of the assembly scheme. Molar equivalents were used 
for alt but block 8 where 0.75 molar equivalents were employed. 
jo Block 3 was ligated with 1 +2, block 7 (0.75 molar equivalent) with 4 to 6 and block 1 1 with 8 to 10. The 
8+9+10+11 assembly has one flush end, hence some blunt-end dimerisation was observed. 

1 to 3 and 4 to 7 were ligated and finally 8 to 11 were ligated to the resulting 1 to 7 species. 

The dimeric 1 to 11 left-hand portion was then cleaved by Hind 111 {EC 3.1.23.21) and Xba I (EC 3.1.23.4) 
to generate the monomeric left-hand portion, with the correct cohesive termini to allow construction of 
is recombinant plasmids. 

Right-hand portion: Blocks 12 and 13 were ligated to form a dimer about the Xba I sfte and blocks 20, 23 
and 22 similarly ligated to form a dimer about the Bam Hi site. Blocks 14, 15 and 17, and 16, 18 and 19 were 
also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly were ligated, as were the 16, 18, 19 asesmbly with the 20, 
20 22, 23 dimer, where block 21 was used as a joining section. 

These two species were then ligated to give an oligomeric molecule which was cleaved by Xba I and 
Bam Hi (EC 3.1.23.6) to give the monomer 12 to 23 species having the correct cohesive termini to allow 
construction of recombinant plasmids. 

The present invention further relates to a plasmid recombinant characterised in that it comprises a 
25 plasmid vector having inserted therein at an appropriate insertion site a synthetic gene or a sub-unit 
thereof as defined above, the plasmid recombinant enabling translation in the correct phase for the mRNA 
corresponding to the inserted gene or sub-unit thereof and having a bacterial promoter upstream of and 
adjacent to the insertion site such that the inserted gene or sub-unit thereof is under bacterial promoter 
control. 

30 A particular vector, designated pLFl, has been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid pLFl, is a 5K bp plasmid which may be 
propagated in E coli and which may be constructed from pBR322 and pUB1 10 {see, for example, Gryczan, 
T. J. r etal, (1978), J. Bacterial., 134, 318) by inserting the DNA sequence of the S. aureus plasmid pUB110 
between the EcoRI and Bam HI sites thereof {comprising approximately 870 bp) between the fcoRI and 

35 Bam HI sites of pBR322, thereby replacing that region of pBR322. 

The present invention further relates to a process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a synthetic gene or a sub-unit thereof as defined above 
into an appropriate insertion site of an appropriate plasmid vector. 
The following illustrates the present invention: 

40 Phosphorylation of oligonucleotide blocks: In each case, 6 ug of oligomer and 60 uCi of [y- 32 P] ATP 
(>5000 Ci/mMol) were dried and redissolved in a final buffer concentration of 50 mM Tris-HCI, pH 7.8, 10 
mM MgCJ 2 , 0.25 mM ATP, 10 mM p-mercaptoethanol and incubated with 4 units {1 unit is the amount that 
catalyses the production of 1 n mole of acid-insoluble 32 P after incubation for 30 minutes at 37°C according 
to Richardson, C.C., (1972), Progress in Nucleic Acids Research, 2, 815) of T4 polynucleotide kinase {EC 

45 2.7.1.78, Bethesda Research Labs) at 37°C for 15 minutes. The enzyme was subsequently inactivated by a 5 
minute incubation at 100°C. 

Ligation of oligonucleotide blocks: Except where indicated above, equimolar quantities (6 to 18 ug) of 
oligonucleotide blocks were incubated in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2/ 1 mM ATP, 20 mM 
dithiothreitol (DTT) with 6 units {1 unit is the amount that catalyses the conversion of 1 n mole of ^PPt into 

so (a/0 32 P)-ATP in 20 minutes at 37°C according to Weiss, B., etal, (1968), J. Biol. Chem., 243 4543) of T4 DNA 
ligase (EC 6.5.1 .1, Bethesda Research Labs) at 25°C for from 3 to 16 hours. Ligated DNA was precipitated by 
addition of 2.5 vol absolute ethanol, collected by centrifugation and redissolved in water. 

Purification of ligated species: Ligated oligonucleotide blocks were electrophoresed on 20% {wM 
polyacrylamide in 90 mM Tris-HCI, pH 8.3, 90 mM boric acid, 2.5 mM EDTA (TBE buffer), and the fragments 

55 located by autoradiography. Slices of gel containing the fragments were excised and the DNA electroeluted 
at 1 mA in TBE buffer onto 0.5 ml of DEAE cellulose (DE52, Whatman) for a few hours. After extensive 
washing by 0.1MNH 4 OAc, 2 mM MgfOAcJz, 0.02% (w/v) SDS, 0.02 mM EDTA {AGEB buffer), the DNA was 
eluted from the DEAE cellulose by 2 ml of 1.1 M NaCI in AGEB buffer and precipitated by addition of 2.5 vol 
absolute ethanol. 

so Construction of chimeric cloning vector pLF1 : In order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or vectors having Hind III, Xba \ and Bam HI cleavage 
sites, see Figure 9 of the accompanying drawings. However, no readily available £1 coli plasmids possess 
Xba I sites. It was surprisingly noticed that the S. aureus plasmid pUB110, which cannot be propagated in £ 
coli (see, for example, Keggins, K. M. et al (1978), Proc. Natl. Acad. Sci. -U.S.A., 75, 1423), contains a region 

65 of about 1kpb of DNA bounded by sites for fcoRI and Bam HI having an approximately central Xba I site. 
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Therefore, pUB110 was cleaved with EcoRI (EC 3.1.4.32) and Bam HI and the DNA fragments 
electrophoresed on 5% (w/v) polyacrylamide. The approximately 1kbp EcoRVBam HI fragment was 
removed by efectroelution from the excised gel slice onto DEAE cellulose, eluted by 1.1 M NaCI and ethanol 
precipitated. The E coti plasmid pAT153 {see, for example, Twigg, A. J., and Sherratt, D., (1980), Nature, 

5 283, 216—218) was also cleaved with EcoRI and Bam HI, and the 3282 bp fragment purified by 
electroelution as above. These two purified DNA fragments were ligated in equimolar quantities using 10 
units of T4 DNA ligase in 50 mM Tris -HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM DTTat 15° for 18 hours. 
After ethanol precipitation, the ligated DNA was dissolved in 100 uj 10 mM Tris pH 7.5, 10 mM MgCI 2 , 10 
mM CaCI 2 and transformed into E coti K12 HB1 01 (genotype gal", lac", ara", pro~, arg~, str*, rec A", r£, M^ ; 

to see, for example, Boyer, H. W., and Roullard-Dussoix, D., J. Mol. Biol., 41, 459 — 472) using known methods 
(see, for example, Cohen, et af, (1972), Proc. Natl. Acad. Sci. U.S.A., 69, 2110—2114) and transformants 
resistant to 100 ug/ml ampicillin selected. Several transformants were analysed further by restriction 
enzyme cleavage of plasmid DNA, and one full length clone, designated pLF1, selected for further use as a 
cloning vector. At a later stage, an additional Hind III restriction sequence was created at the EcoRI site. 

/5 Cloning of the synthetic urogastrone gene in pLF1 : The two portions of the assembled urogastrone 
gene were cloned in two transformation stages, as illustrated in Figure 10 of the accompanying drawings. 

Right-hand portion: The longer fragment of Kba I, Bam Hl-cleaved pLF1 was purified by electroelution 
as above and ligated to a large excess of the right-hand portion assembly of the urogastrone gene using 10 
units T4 DNA ligase in 50 mM Tris-HCt, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM DTT at 15°C for 18 hours. 

20 This was transformed into E coll as before, with selection for 100 pg/ml ampicillin. Several transformants 
were selected for plasmid analysis by restriction enzyme cleavage analysis and one clone, designated 
pUR1, used for further cloning. The sequence of the inserted urogastrone gene DNA was confirmed by 
chemical degradation analysis (see, for example, Maxam, A., and Gilbert, W., (1977), Proc. Natl. Acad. Sci. 
U.S.A., 74, 560-4). 

25 In order to have a Hind III site for ligation to the 5' end of the left-hand portion of the gene, the EcoRI 
site was modified as follows: pUR1 was cleaved with EcoRI and the resulting recessed ends filled using 5 
units (1 unit is the amount that incorporates 10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37°C using poly-d(A-T) as primer according to Richardson, C.C., et a/, (1964), J. 
Biol. Chern., 239, 222) of DNA polymerase I (EC 2.7.7.7) in 50 mM Tris-HCI, pH 7.5, 10 mM MgCI 2 , 1 mM 

30 p-mercaptoethanol with 0.25 mM of each deoxynucleotide for 30 minutes at 15°C. 

To the resulting flush ends, were ligated a large excess of synthetic Hind 111 linkers (Collaborative 
Research) by the known blunt end ligation procedure (see, for example, Ullrich, A., et af, (1977), Science, 
796, 1313—1319). The DNA was ligated using 6 units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATp, 20 mM DTT at 15°C for 18 hours. 

3s After Hind III restriction cleavage and purification, the full length plasmid was religated and 
transformed into E coti as above, with selection for 100 ug/ml ampicillin. A transformant having a Hind III 
site, designated pUR2, was selected for further cloning. 

Left-hand portion: pUR2 was cleaved with Hind III and Xba\ restriction enzymes and the longer 
fragment purified by electroelution as above. This was ligated to a two-fold molar excess of the assembled 

40 left-hand portion of the urogastrone gene using 6 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATP, 20 mM DTT for 18 hours at 15°C. This was transformed into E coli K12 MRC 8 (genotype 
dap 103 hsd R met Bl glm 533 upp 1 dap 101 sup E thy A 103 deo rec A1) with selection for 100 ug/ml 
amicillin. Several transformants were selected for plasmid analysis by restriction enzyme cleavage. One 
clone, designated pUR1 was used for further characterisation and cloning. The sequence of the full 

45 urogastrone gene was confirmed by chemical degradation analysis. It should be noted that pUR1 has no 
remaining S aureus DNA sequence present. 

The present invention also relates to a bacterial celt, in particular an E coli cell, characterised in that it 
comprises inserted therein a plasmid recombinant as defined above. 

The present invention further relates to a process for the production of such a bacterial cell 

so characterised in that it comprises inserting a plasmid recombinant as defined above into a bacterial cell. 

Expression of urogastrone in E coli: The urogastrone gene insert was cleaved from pUR1 by Hind III 
and Bam HI cleavage and purified by polyacrylamide gel electrophoresis and electroelution as above. This 
fragment was ligated to Hind III, Bam Hl-cleaved pWT121 and pWT221, (see, for example, Tacon, W.C.A., et 
at, (1980), Molec. Gen. Genet 177, 427) and the recombinant molecules used to transform E coli MRC 8, 
55 (see for example, Emtage, J. S., etal, (1980), Nature, 283, 171 — 174), with selection for 100 ug/ml ampicillin. 
Transformants containing full length urogastrone genes were characterised by restriction enzyme cleavage 
analysis and DNA was purified by isopycnic centrifugation in caesium chloride. 

Expression of urogastrone-like fusion protein was induced by growth of cells in L-broth (luria broth: 
60 1% (w/v) bacto tryptone., 0.5% (w/v) bacto yeast extract, 0.5% (w/v) NaCI, 0.2% (w/v) glucose, 0.004% (w/v) 
thymine, pH 7) containing 100 ug/ml ampicillin to an A600 nm of 0.3. Following centrifugation, the cells 
were washed and resuspended in M9 medium lacking tryptophan, but containing 20 *igml 3 3-indole acrylic 
acid. The cells were incubated at 37°C for 4 hours. Under these conditions maximal tryptophan promoter 
activity is known to occur (see Tacon, etal, toe cit), and hence expression of the urogastrone fusion protein. 
55 The present invention also relates to a process for the production of urogastrone or an equivalent 
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thereof or a sub-unit thereof characterised in that it comprises culturing a bacterial cell as defined above 

and recovering expressed protein. 

The present invention further relates to a fused polypeptide characterised in that it comprises 

urogastrone or an equivalent thereof or a sub-unit thereof covalently bonded with all or part of a gene of 
5 the tryptophan operon, e.g. trp E, and in that it is obtained by culturing a bacterial cell as defined above and 

recovering expressed protein. 

As mentioned above, urogastrone has an application in the treatment of ulcers, and also in other 

instances where the growth promoting activity thereof would be beneficial, for example, in wound healing. 

Conventional administration forms may be used, the active material being used in an effective amount, for 
w example from 0.1 to 1,0 u.g/kg body" weight, preferably from 0.125 to 0.5 pg/kg body weight more 

preferably about 0.25 ug/kg body weight, optimally together with a conventional pharmaceutically- 

acceptable carrier, diluent or adjuvant 

Claims 

15 

1. A synthetic gene characterised in that it codes for the expression of urogastrone or an equivalent 
thereof or a sub-unit thereof in a bacterial cell and comprises the following sequence or a sub-unit thereof: 

Coding: 5'AATTCCGATAGCGAGTGTCCTCT- 
20 Non-coding: 3' TTAAGGCTATCGCTCACAGG AG A- 

GAGTCACGATG GTTACTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTT- 
25 GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGCGCACATT. AACGCAACAACCGATGTA- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
ATGGTGGGAACTTAGA 
30 TACCACCCTTGAATCT 

2. A synthetic gene as claimed in Claim 1 characterised in that it comprises the following sequence or a 
sub-unit thereof: 

35 Coding: 5' AG CTTAAAAAG AATTCCG ATAG- 

Non-coding: 3' ATTTTTCTTA AG G CTATC- 
CGAGTGTCCTCTGAGTCACGATG GTTACT- 
GCTCACAGGAGACTCAGTGCTA. CCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 

40 C A G A TGTACTGCCGCAGACATACATATAA- 

GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 

45 AGTATCGAGATCTGAAATGGTGGGAACTT- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

so 3. A process for the production of a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 
2 characterised in that it comprises the assembly and ligation of a number of oligoncleotide blocks. 

4. A plasmid recombinant characterised in that it comprises a plasmid vector having inserted therein at 
an appropriate insertion site a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2, the 
plasmid recombinant enabling translation in the correct phase for the mRNA corresponding to the inserted 

55 gene or sub-unit thereof and having a bacterial promoter upstream of and adjacent to the insertion site 
such that the inserted gene or sub-unit thereof is under bacterial promoter control. 

5. A plasmid recombinant as claimed in Claim 4 characterised in that it comprises as the plasmid vector 
pLFI. 

6. A process for the production of a plasmid recombinant as claimed in Claim 4 characterised in that it 
60 comprises inserting a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2 into an 

appropriate insertion site of an appropriate plasmid vector. 

7. A bacterial cell characterised in that it comprises inserted therein a plasmid recombinant as claimed 
in Claim 4 or Claim 5. 

8. A process for the production of a bacterial cell as claimed in Claim 7 characterised in that it 
65 comprises inserting a plasmid recombinant as claimed in Claim 4 or Claim 5 into a bacterial cell. 
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9. A process for the production of urogastrone or an equivalent thereof or a sub-unit thereof 
characterised in that it comprises culturing a bacterial cell as claimed in Claim 7 and recovering expressed 
protein. 

10. A fused polypeptide characterised in that it comprises urogastrone or an equivalent thereof or a 
5 sub-unit thereof covalently bonded with all or part of a gene of the tryptophan operon and in that it is 

obtained by culturing a bacterial cell as claimed in Claim 7 and recovering expressed protein. 

1 1 . A fused polypeptide as claimed in Claim 1 0 wherein the gene of the tryptophan operon is the trp E 
gene. 

to Patentansp ruche 

1. Synthetisches Gen, dadurch gekennzeichnet, daS es die Expression fur Urogastron oder ein 
Aquivalent desselben oder eine Untereinheit desselben in einer Bakterienzetle codiert und die folgende 
Sequenz oder eine Untereinheit derselben umfaSt: 

is 

codierend: 5' AATTCCGATAGCG AGTGTCCTCT- 
nicht codierend: 3'TTAAGGCTATCGCTCACAGGAGA- 
GAGTCACGATGGTTACTGTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 

20 CGTCTGTATGTATATTGAGGCTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTT- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGCGCACATTAACGCAACAACCGATGTA- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 

25 GCCACTCGCGACAGTCATAGCTCTAGACTT- 
ATGGTGGGAACTTAGA 
TACCACCCTTGAATCT 

2. Synthetisches Gen nach Anspruch 1, dadurch gekennzeichnet, dalS es die folgende Sequenz oder 
30 eine Untereinheit derselben umfaftt: 

codierend: 5' AGCTTAAAAAG AATTCCG ATAG- 
nicht codierend: 3'ATTTTTCTTAAG GCTATC- 
CGAGTGTCCTCTGAGTCACGATG GTTACT- 

35 GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATGACGGCGTCTGTATGTAT- 
CAGATGTACTGCCGCAGACATACATATAA- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 

40 CGTTGTTGGCTACATCG GTGAGCGCTGTC- 

GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATGGTGGGAACTT- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

3. Verfahren zur Erzeugung eines synthetischen Gens oder einer Untereinheit desselben nach 
Anspruch 1 oder Anspruch 2, dadurch gekennzeichnet, date es den Zusammenbau und die Verknupfung 
einer Anzah! von Oligonucleotid-Blocken umfaSt 

50 4. Rekombinantes Plasmid, dadurch gekennzeichnet daS es einen Plasmid-Vektor umfafit, der ein 
synthetisches Gen oder eine Unterhert desselben nach Anspruch 1 oder Anspruch 2, eingefugt an einer 
geeigneten Insertionsstelle, aufweist wobei das rekombinante Plasmid die Translation in der korrekten 
Phase fur die dem eingefugten Gen oder der etngefugten Untereinheit desselben entsprechende mRNA 
ermoglicht und einen soichen Bakterien-Promotor von (stromaufwarts von) der Insertionsstelle und dieser 

55 benachbart aufweist, dafc das eingefugte Gen oder die Untereinheit desselben der Steuerung durch den 
Bakterien-Promotor unterliegen. 

5. Rekombinantes Plasmid nach Anspruch 4, dadurch gekennzeichnet, daS es als den Plasmid-Vektor 
pLFt enthalt. 

6. Verfahren zur Erzeugung eines rekombtnanten Plasmids nach Anspruch 4, dadurch gekennzeichnet, 
so daS es das Einfugen eines synthetischen Gens oder einer Untereinheit desselben nach Anspruch 1 oder 

Anspruch 2 an einer geeigneten Insertionsstelle eines geeigneten Plasmid-Vektors umfafit. 

7. Bakterienzelle, dadurch gekennzeichnet, dafc sie darin eingefugt ein rekombinantes Plasmid nach 
Anspruch 4 oder Anspruch 5 enthalt 

8. Verfahren zur Erzeugung einer Bakterienzelle nach Anspruch 7, dadurch gekennzeichnet, daS es das 
65 Einfugen eines rekombinanten Plasmids nach Anspruch 4 oder Anspruch 5 in eine Bakterienzelle umfaSt. 



7 



0 046 039 

9. Verfahren zur Erzeugung von Urogastron oder eines Aquivalents desselben oder einer Untereinheit 
desselben, dadurch gekennzeichnet, daS eine Bakterienzelle nach Anspruch 7 kultiviert wird und das 
exprimierte Protein isoliert wird. 

10. Fusioniertes Polypeptid, dadurch gekennzeichnet, daS es Urogastron oder ein Aquivalent 
5 desseiben oder eine Untereinheit desselben kovalent gebunden mit dem gesamten Gen oder einem Teil 

des Gens des Tryptophan-Operons umfafct und daS es durch Kultivieren einer Bakterienzelle nach 
Anspruch 7 und Isolierung des exprimierten Proteins erhalten wird. 

11. Fusioniertes Polypeptid nach Anspruch 10, dadurch gekennzeichnet, daB das Gen des Tryptophan- 
Operons das trp-E-Gen ist 

w 

Revendications 

1. Gene synthetique caracterise en ce qu'il code pour I'expression d'urogastrone ou un equivalent de * 
is cette derniere ou une sous-unite de cette derniere dans une cellule bacterienne et comprend la sequence 

suivante ou une sous-unite de cette derniere: 

Codant: 5' AATTCCG ATA6CG AGTGTCCTCT- 
Non-codant: 3'TTAAG GCTATCGCTCACAG GAGA- 
GAGTCACGATGGTTACTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAGGCTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTT- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
CATGCGCACATTAACGCAACAACCGATGTA- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
ATGGTGGGAACTTAGA 
TACCACCCTTGAATCT 

30 

2. Gene synthetique selon la revendication 1, caracterise en ce qu'il comprend la sequence suivante ou 
une sous-unite de cette derniere: 

Codant: 5'AGCTTAAAAAG AATTCCGATAG- 
Non-codant: 3'ATTTTTCTTAAG GCTATC- 
CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGT G C TACCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 
CAGATGTACTGCCGCAGACATACATATAA- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATGGTG GGAACTT- 
TCATAGCTCTAGACTTTACCACCCTTGAA- 
A G A T A A G 
TCTATTCCTAG 

3. Procide de production d'un gene synthetique ou d'une sous-unite de ce dernier selon la 
so revendication 1 ou la revendication 2, caracterise en ce qu'il comprend ('assemblage et la liaison d'un 

certain nombre de fragments d'oligonucleotides* 

4. Plasmide recombinant caracterise en ce qu'il comprend un plasmide yecteur dans lequel est insere 
en un site dlnsertion approprie un gene synthetique ou une sous-unite de ce dernier selon la revendication 
1 ou la revendication 2 f le plasmide recombinant permettant la traduction en la phase correcte pour 

55 I'ARN-m correspondent au gene insere ou a la sous-unite de ce dernier et ayant un promoteur bacterien en 
amont et adjacent par rapport au site d'insertion, tel que le gene insert ou la sous-unite de ce dernier se 
trouve sous le controte du promoteur bacterien. 

5. Plasmide recombinant selon la revendication 4 r caracterise^ en ce qu'il comprend en tant que 
plasmide vecteur pLFI. 

so 6. Precede de production d'un plasmide recombinant selon la revendication 4, caracterise en ce qu'il 
comprend ('insertion d'un gene synthetique ou d'une sous-unite de ce dernier selon la revendication 1 ou la 
revendication 2 dans un site d'insertion approprie d'un plasmide vecteur approprie. 

7. Cellulose bacterienne caracterisee en ce que s'y trouve insere un plasmide recombinant selon la 
revendication 4 ou la revendication 5. 

65 8. Procede de production d'une cellule bacterienne selon la revendication 7, caracterise en ce qu'il 
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comprend I'insertion d'un plasmide recombinant selon la revendication 4 ou la revendication 5 dans une 
cellule bacterienne. 

9. Procede de production d'urogastrone ou d'un equivalent de cette derniere ou d'une sous-unite de cette 
derniere, caracterise en ce qu'il comprend la culture d'une cellule bacterienne selon la revendication 7 et la 

5 recuperation de la proteine exprimee. 

10. Polypeptide fusionne caracterise en ce qu'il comprend I'urogastrone ou un equivalent de cette 
derniere ou une sous-unite de cette derniere liee a tout ou partie d'un gene de I'operon tryptophane et en ce 
qu'il est obtenu en cultivant une cellule bacterienne selon la revendication 7 et en recuperant la proteine 
exprimee. 

to 11. Polypeptide fusionne selon ia revendication 10, caracterise en ce que le gene de I'operon 
tryptophane est le gene trp E. 
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-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser Lvs 
coding ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



Non-coding T T C 



1 5 
Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTfeTCCTCTG 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGTACTGCCGCAG 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fjg . 1 (Part lof 2) 
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35 ^0 
Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg 
TGTAATT6CGTTGTTGGCTACATCGGTGAGCGC 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 



Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 



ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 



55 

Arg Ter Gly Ser 
AGATAAGGATCC coding 



TCTATTCCTAGG Non-coding 



Fjg.1 (Part 2 of 2) 
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Hind 111 

-10 -5 
Met Gin Thr Gtn Lys Pro Thr Pro Ser Ser Lys 

ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 

TT C 

• » t » • 



• « t 



EcoRI Mnl I 

1 5 



Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 

:ttaaaaagaattccgatagcgagtgtcctctg 
r 



c 

t * » 



gaatttttcttaaggctatcgctcacaggagac 

»» » 



Hint I AccI Hgal 

10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGT.CTACATGACGGCGTC 

—I " 



TCACTGCTACCAATGACAGAT GT»AC T GCCGCAG 
""'I* 1*** Hgal 



Mnll Xbal FnuDII 

20 • 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 

I 



ACATACATATAACTCCGAGATCTGTTCATGCGC 

Hhal 



Fjg. 2 (Part lof 2) 
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HphI Haell 
35 40 
Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg 
TGTAATTGCGTTGTTGGCT AC A T C GG T G AG C G C 

"I 

ACATTAACGCAACAACCGATGTAGCCACTCGCG 

Mbol 
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Description 

This invention relates to a synthetic urogastrone gene, to corresponding plasmid recombinants and 
transformed cells, to the production thereof and to urogastrone expression. 

Urogastrone is a polypeptide hormone (protein) synthesised in the duodenum and in the salivary - 
glands of normal humans, (see, for example, Hertz et al. (1978), GUT, 19 f 408-413). Urogastrone 
5 suppresses the secretion of gastric acid and promotes cell growth (see, for example, Elder et al. (1975), 
Gut 16, 887-893). Therefore, it has an application in the treatment of ulcers and in the promotion of 
wound healing. Urogastrone is excreted in small amounts in human urine and may be isolated therefrom. 
There exists, however, a need for a more viable commercial production thereof and such is provided 
according to the present invention. 
10 Urogastrone is known to consist of 53 amino acids in the following sequence : 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp- 
Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr- 
lle Glu Ala Leu Asp Lys Tyr Ala Cys Asn Cys- 
15 Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr- 

Arg Asp Leu Lys Trp Trp Glu Leu Arg 

(see, for example, Gregory H. and Preston B. M. (1977), Int J. Peptide Protein Res., 9, 107-118). 

From the above amino acid sequence, a corresponding synthetic gene sequence has been invented, 

20 subject to a number of specific non-obvious criteria, and oligonucleotide blocks synthesised which, when 
assembled,, form a synthetic gene coding for urogastrone. The blocks have been hybridised and Eigated in 
pre-determined stages to construct the urogastrone gene in two portions. These have been cloned in two 
operations into a new specifically-designed chimeric E. coli/S. aureus vector so as to produce a full length 
urogastrone gene flanked only by E. coli plasmid DNA. The gene has been excised from this recombinant 

25 and re-cloned into vectors specifically designed to maximise expression of the gene in E. coli, under the 
control of the promoter obtained from the E. coli tryptophan operon. A protein resembling human 
urogastrone has thus been expressed in E. coli. 

From the above amino acid sequence, because of the degeneracy of the genetic code, it is possible to 
predict numerous nucleotide sequences which would code for the protein. 

30 In the inventive determination of an optimum sequence from the large number of possibilities, several 
non-obvious criteria have been observed. Firstly, trinucleotide codons should be used which are 
acceptable or preferable in the cells to be used, in particular E coli. Secondly, it was decided that it was 
desirable to have different restriction enzyme recognition sites at the termini of the molecule so as to 
allow insertion into a plasmid in a desired orientation. Moreover, it was decided to select sites which 

35 allowed the use of well-understood cloning vectors, such as pBR322 (see, for example, Bolivar F. et al. 
(1977), Gene, 2, 95-113). In fact, Hind III and Bam HI sites were selected and introduced at the 5' and 3' 
ends, respectively. Thirdly, it was thought desirable to introduce a series of restriction endonudease 
recognition sites strategically placed along the molecule to enable the gene to be specifically disected to 
aid characterisation and, possibly mutagenesis. Also, this measure allowed the two portions of the 

40 molecule to be cloned in stages. In particular, an Xba I site was introduced at a central location in the 
gene. Fourthly, the synthesis should not be unnecessarily complicated and illegitimate cross-hybridis- 
ations should be minimised in order to facilitate gene assembly. Using a computer, energies of 
interactions arising from ail possible approximations were calculated (see, for example, Tinoco Jr. I et al. 
(1971), Nature; 230, 362-367; Powers G. J. et al. (1975), JACS, 97, 875-899, so that stable off-diagonal 

45 interactions might be avoided where possible. Frfthly, since the protein ultimately expressed in bacterial 
cells will be in the form of a fusion product, it was desirable to have a means of cleaving the urogastrone 
portion from such a fusion product Since urogastrone is known to be insensitive to trypsin (see, for 
example, Gregory and Preston, loc. cit), the codons specifying the dipeptide lys-lys were introduced near 
the end of the gene corresponding to the urogastron N-terminus in order to serve as a substrate for tryptic 

50 proteolysis. 

The present invention relates to a synthetic gene characterised in that it codes for the expression of 
urogastrone or an equivalent thereof or a sub-unit thereof in a bacterial cell and comprises the following 
sequence or sub-unit thereof : 

55 5'AATTCCGATAGCGAGTGTCCTCTG- 
3'TTAAGGCTATCGCTCACAGGAGAC- 
AGTCACGATGGTTACTGTCTACATGACGG- 
TCAGTGCTACCAATGACAGATGTACTG C C- 
C G T C T G T A T G T A T A T T G A G G C T C T A G A C A- 

60 G C A G A C A T A C A T A T A A C T C C G A G A T C T G T- 

AGTACGCGTGTAATTGCGTTGTTGGCTAC- 
TCATGCGCACATTAACGCAACAACCGATG- 
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A T C G G T G A G C G C T G T C A G T A T C G A G A T C T- 
TAGCCACTCGCGACAGTCATAGCTCTAGA- 
GAAATGGTGGGAACTTAGA 
CTTTACCACCCTTGAATCT 



In fact, bearing in mind, inter alia, the above-mentioned considerations, for convenience, a slightly 
longer sequence was selected which is as follows : 

10 

Coding 5' AGCTTAAAAAG AATTCCG ATAGCGAGT^ 
Non-Coding 3''A TTTTTCTTAAGG CTATCGCTCA- 
GTCCTCTGAGTCACGATGGTTACTGTCT- 

1S CAGGAGACTCAGTGCTACCAATGACAGA- 
ACATGACGGCGTCTGTATGTATATTGAG- 
T G T A C T G C C G C A G A C A T A C A T A T A A C T C- 
GCTCTAGACAAGTACGCGTGTAATTGCG- 
CGAGATCTGTTCATGCGCACATTAACGC- 

2Q TTGTTGGCTACATCGGTGAGCGCTGT- CA- 

AACAACCGATGTAGCCACTCGCGACAGT- 
GTATCGAGATCTGAAATGGTGGGAACTT- 
CATAGCTCTAGACTTTACCACCCTTGAA. 
A G A T A A G 
T C T A.T T C C T A G 



The whole of this expanded sequence is shown in Figure 1 of the accompanying drawings and the 
restriction enzyme recognition sites are shown in Figure 2 of the accompanying drawings. 

The present invention also relates to a process for the production of the above said synthetic genes or 
30 sub-units thereof characterised in that it comprises the assembly and ligation of a number of 
oligonucleotide blocks. 

Such « blocks *> should not be regarded as sub-units of the gene. In the present context « sub-unit » 
refers to a sequence which is less than the whole, but which exhibits the desired properties. 

ft was in feet decided to synthesise a molecule having the above expanded sequence by making 23 
35 synthetic oligonucleotide blocks as illustrated in Figure 3 of the accompanying drawings, which will 
assemble by single-strand overlaps to give the complete double-stranded nucleotide sequence. 

In order to minimise strong illegitimate interactions near the centre of the molecule during assembly, 
certain blocks of modified sequence have also been synthesised, as illustrated in Figure 4 of the 
accompanying drawings. 

40 As mentioned above, the synthetic blocks selected are shown in Figure 3 of the accompanying 
drawings. The blocks may be constructed using known synthesis techniques (see, for example, AgarwaJ, 
et al. (1970), Nature, 227, 27-34 ; and Crea et ai. (1978), Proc. NatL Acad Sd. U.S.A., 75, 5765-5769). . 

The synthetic methods will now be illustrated with reference to the synthesis of the tetradecanuc- 
leotide ApGpTpTpCpCpCpApCpCpApTpTpT. 

45 The methods of building up oligonucleotides from smaller units by successive coupling reactions are 
well known (see, for example, Hsiung et al. (1979), Nucleic Acid Research, 6, 1371-1385). The completely 
protected tetradecanucleotide was built up as shown in Rgure 5 of the accompanying drawings (wherein, 
for convenience, protecting groups are not shown). 

The condensation reactions indicated by arrows in Rgure 5 were carried out by the following 

50 procedure exemplified in the synthesis of : 



DMTr A te p G ,Bu p-CNEt 
55 <j>Cl <|>Ci 



as shown in Figure 6 of the accompanying drawings. From 1.1 to 1.5 mmole of the 3'-phosphodiester 
component (I) was condensed with 1.0 mmole of the S'-hydroxyl component (II) in anhydrous pyridine in 

60 the presence of from 3 to 4.5 mmoies of 2,4,6-tnTsopropyl-benzene-sulphonyi-tetrazoHde. The reaction 
was left for 1 hour at room temperature or until chromatography on silica TLC plates eluted with 10 % (v/v) 
methanol in chloroform showed that the hydroxyl component was exhausted. The reaction was quenched 
with 5% (w/v) sodium bicarbonate solution and extracted with chloroform. The chloroform extract was 
dried and loaded onto a reverse phase chromatography column (ODS bonded to 15-25 micron silica). The 

65 fully protected dinucleotide product (111) was eluted with a solvent gradient from chloroform : methanol : 



4 
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water (2 : 6 : 3 v/v) to chloroform : methanol : water (2:6: 0.5 v/v). The product (III) was extracted into 
chloroform and dried. The final isolated yield was 81 %. 

To proceed to further condensations, the terminal protecting group (DMTr or CNEt) was removed 
selectively using triethylamine in pyridine (CNEt) or a 2% (w/v) solution of benzene sulphonic acid in 

5 chloroform : methanol (DMTr) as shown in Figure 7 of the accompanying drawings. 

At the completion of the synthesis, all of the protecting groups were removed by sequential treatment 
with 0.1 M tetraethyiammonium fluoride in THF/pyridine/water (8:1:1 v/v), ammonia and 80 % acetic 
add. The deprotected oligonucleotides were purified by ion-exchange HPLC and sequence analysis was 
carried out by the method of Wu et al. (1976), Anal. Biochem., 74, 73-93. 

10 The oligomeric blocks of nucleotides were hybridised and ligated (see, for example, Agarwal et al., 
loc. cit) in a series of steps, in order to minimise the possibilities for undesirable interactions, leading to 
the formation of the two portions as shown in Figure 8 of the accompanying drawings. The order of the 
additions in the assembly scheme was optimised for minimal incorrect ligations and in the case of 
especially difficult oligomeric blocks, notably 7 and 8, sub-molar quantities were used in order to remove 

15 ail monomeric units before further additions were made. 
In more detail : 

Left-hand portion : Blocks 1 and 2 were iigated to form a dimer about the Hind III site. Blocks 4, 5 and 
6 and 8, 9 and 10 were also ligated in the first round of the assembly scheme. Molar equivalents were used 
for all but block 8 where 0.75 molar equivalents were employed. 
20 Block 3 was ligated with 1 + 2, block 7 (0.75 molar equivalent) with 4 to 6 and block 11 with 8 to 10. 
The 8 + 9 + 10 + 11 assembly has one flush end, hence some blunt-end dimerisation was observed. 

1 to 3 and 4 to 7 were ligated and finally 8 to 11 were ligated to the resulting 1 to 7 species. 

The dimeric 1 to 11 left-hand portion was then cleaved by Hind ill (EC 3.1.23.21) and Xba I (EC 
3.1.23.4) to generate the monomeric left-hand portion, with the correct cohesive termini to allow 
25 construction of recombinant piasmids. 

Right-hand portion : Blocks 12 and 13 were ligated to form a dimer about the Xba I site and blocks 20, 
23 and 22 similarly ligated to form a dimer about the Bam HI site. Blocks 14, 15 and 17, and 16, 18 and 19 
were also ligated at this stage. 

The 12, 13 dimer and the 14, 15, 17 assembly were ligated, as were the 16, 18, 19 assembly with the 20, 
30 22, 23 dimer, where block 21 was used as a joining section. 

These two species were then iigated to give an oligomeric molecule which was cleaved by Xba I and 
Bam Hi (EC 3.1.23.6) to give the monomer 12 to 23 species having the correct cohesive termini to allow 
construction of recombinant plasmids. 

The present invention further relates to a plasmid recombinant characterised in that it comprises a 
35 plasmid vector having inserted therein at an appropriate insertion site a synthetic gene or a sub-unit 
thereof as defined above, the plasmid recombinant enabling translation in the correct phase for the 
mRNA corresponding to the inserted gene or sub-unit thereof and having a bacterial promoter upstream 
of and adjacent to the insertion site such that the inserted gene or sub-unit thereof is under bacterial 
promoter control. 

40 A particular vector, designated pLFl, has been specifically designed to afford advantages for the 
purposes of the present invention. The inventive plasmid pLFl, is a 5K bp plasmid which may be 
propagated in E coli and which may be constructed from pBR322 and pUB110 (see, for example, 
Gryczan, T. J. et al. (1978), J. Bacterial., 134, 318) by inserting the DNA sequence of the S. aureus plasmid 
pUB110 between the EcoRI and Bam HI sites thereof (comprising approximately 870 bp) between the 

45 EcoRI and Bam HI sites of pBR322, thereby replacing that region of pBR322. 

The present invention further relates to a process for the production of such a plasmid recombinant 
characterised in that it comprises inserting such a synthetic gene or a sub-unit thereof as defined above 
into an appropriate insertion site of an appropriate plasmid vector. 
The following illustrates the present invention : 

50 Phosphorylation of oligonucleotide blocks : In each case, 6 jjuj of oligomer and 60 |juCi of [r 32 ?] ATP 
(> 5 000 Ci/mMol) were dried and redissolved in a final buffer concentration of 50 mM Tris-HCI, pH 7.8, 10 
mM Mgd 2 , 0.25 mM ATP, 10 mM 0-mercaptoethanol and incubated with 4 units (1 unit is the amount that 
catalyses the production of 1 n mole of acid-insoluble ^P after incubation for 30 minutes at 37 °C 
according to Richardson, C.C., (1972), Progress in Nucleic Acids Research, 2, 815) of T4 polynucleotide 

55 kinase (EC 2.7.1.78, Bethesda Research Labs) at 37 °C for 15 minutes. The enzyme was subsequently 
inactivated by a 5 minute incubation at 100 °C. 

Ligation of oligonucleotide blocks : Except where indicated above, equimolar quantities (6 to 18 \sjq) 
of oligonucleotide blocks were incubated in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 mM 
drthiothreitol (DTT) with 6 units (1 unit is the amount that catalyses the conversion of 1 n mole of ^PPi 

60 into (a/p 32P)-ATP in 20 minutes at 37 °C according to Weiss B. et al. (1968), J. Biol. Chem., 243 4543) of T4 
DNA Rgase (EC 6.5.1.1, Bethesda Research Labs) at 25 °C for from 3 to 16 hours. Ligated DNA was 
precipitated by addition of 2.5 vol absolute ethanol, collected by centrifugation and redissolved in water. 

Purification of ligated species : Ligated oligonucleotide blocks were electrophoresed on 20 % (w/v) 
potyacrylamide in 90 mM Tris-HCI, pH 8.3. 90 mM boric acid, 2.5 mM EDTA (TBE buffer), and the 

65 fragments located by autoradiography. Slices of gel containing the fragments were excised and the DNA 



5 
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electroeluted at 1 mA in TBE buffer onto 0.5 ml of DEAE cellulose (DE52, Whatman) for a few hours. After 
extensive washing by 0.1MNH 4 OAc, 2 mM Mg(OAc) 2 , 0.02 % (w/v) SDS, 0.02 mM EDTA (AGEB buffer), the 
DNA was eluted from the DEAE cellulose by 2 ml of 1.1 M NaCI in AGEB buffer and precipitated by 
addition of 2.5 vol absolute ethanol. 
5 Construction of chimeric cloning vector pLFl : In order to facilitate the two-stage cloning of 
urogastrone, it was desirable to construct a vector or vectors having Hind III, Xba I and Bam HI cleavage 
sites, see Rgure 9 of the accompanying drawings. However, no readily available E. coli plasmids possess 
Xba I sites. It was surprisingly noticed that the S. aureus plasmid pUBl10, which cannot be propagated in 
E. Coli (see, for example, Keggins K. M. et al. (1978), Proa Natl. Acad. Sci. U.S.A., 75, 1423), contains a 

10 region of about 1kbp of DNA bounded by sites for EcoRI and Bam HI having an approximately central Xba 
I site. Therefore, pUB110 was cleaved with EcoRI (EC 3.1.4.32) and Bam Hi and the DNA fragments 
electrophoresed on 5% (w/v) polyacrylamide. The approximately 1 kbp EcoRI/Bam HI fragment was 
removed by electroelution from the excised gel slice onto DEAE cellulose, eluted by 1.1 M NaCI and 
ethanol precipitated. The E coli plasmid pAT153 (see, for example, Twigg, A. J., and Sherratt D. (1980), 

15 Nature, 283, 216-218) was also cleaved with EcoRI and Bam HI, and the 3282 bp fragment purified by 
electroelution as above. These two purified DNA fragments were ligated in equimolar quantities using 10 
•units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM MgCfe, 1 mM ATP, 20 mM DTT at 15° for 
18 hours. After ethanol precipitation, the ligated DNA was dissolved in 100 jxl 10mM Tris pH 7.5, 10 mM 
MgCI 2 , 10 mM CaCfe and transformed into E. coli K12 HB101 (genotype gal~, lacf, ara-, pro-, arg~, 

20 str, rec A-, r k -. M k ~; see, for example, Boyer H. W; and Rouliard-Dussoix D„ J. MoL Biol., 41, 459- 
472) using known methods (see, for example, Cohen et al. (1972), Proc. Natl. Acad. Sci. U.SA, 69, 2110- 
2114) and transformants resistant to 100|xg/ml ampicillin selected. Several transformants were analysed 
further by restriction enzyme cleavage of plasmid DNA, and one full length clone, designated pLF1, 
selected for further use as a cloning vector. At a later stage, an additional Hind III restriction sequence was 

25 created at the EcoRI site.. 

Cloning of the synthetic urogastrone gene in pLF1 : The two portions of the assembled urogastrone 
gene were cloned in two transformation stages, as illustrated in Figure 10 of the accompanying drawings. 

Right-hand portion : The longer fragment of Xba t, Bam Hl-cieaved pLF1 was purified by 
electroelution as above and ligated to a large excess of the .right-hand portion assembly of the 

30 urogastrone gene using 10 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM MgCI 2 , 1 mM ATP, 20 
mM DTT at 15 °C for 18 hours. This was transformed into E. coli as before, with selection for 100 \sg/m\ 
ampicillin. Several transformants were selected for plasmid analysis by restriction enzyme cleavage 
analysis and one clone, designated pUR1 f used for further cloning. The sequence of the inserted 
urogastrone gene DNA was confirmed by chemical degradation analysis (see, for example, Maxam, A., 

35 and Gilbert, W., (1977), Proa Natl. Acad. Sci U.SA, 74, 560-4). 

In order to have a Hind III site for ligation to the 5' end of the left-hand portion of the gene, the EcoRI 
site was modified as follows : pUR1 was cleaved with EcoRI and the resulting recessed ends filled using 5 
units (1 unit is the amount that incorporates 10 n moles of total nucleotides into an acid-precipitable 
fraction in 30 minutes at 37 °C using poly-d(A-T) as primer according to Richardson, C.C. et aL (1964), J. 

40 Biol. Chem., 239, 222) of DNA polymerase I (EC 2.7.7.7) in 50 mM Tris-HCI, pH 7.5, 10 mM MgCI 2 , 1 mM p- 
mercaptoethanol with 0.25 mM of each deoxynucleotide for 30 minutes at 15 °C. 

To the resulting flush ends, were ligated a large excess of synthetic Hind III linkers, (Collaborative 
Research) by the known blunt end ligation procedure (see, for example, Ullrich A. et al. (1977), Science 
196, 1313-1319). The DNA was ligated using 6 units of T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 

45 Mgd 2 , 1 mM ATP, 20 mM DTT at 15 °C for 18 hours. 

After Hind III restriction cleavage and purification, the full length plasmid was religated and 
transformed into E. coli as above, with selection for 100 yug/ml ampicillin. A transformant having a Hind III 
site, designated pUR2, was selected for further cloning. 

Left-hand portion : pUR2 was cleaved with Hind lit and Xba! restriction enzymes and the longer 

50 fragment purified by electroelution as above. This was ligated to a two-fold molar excess of the assembled 
left-hand portion of the urogastrone gene using 6 units T4 DNA ligase in 50 mM Tris-HCI, pH 7.8, 10 mM 
MgCI 2 , 1 mM ATP, 20 mM DTT for 18 hours at 15 °C. This was transformed into E. coli K12 MRC 8 
(genotype dap 103 hsd R met Bl glm 533 upp 1 dap 101 sup E thy A 103 deo rec A1) with selection for 
100jig/ml ampicillin. Several transformants were selected for plasmid analysis by restriction enzyme 

55 cleavage. One clone, designated pURl was used for further characterisation and cloning. The sequence 
of the full urogastrone gene was confirmed by chemical degradation analysis. It should be noted that 
pUR1 has no remaining S aureus DNA sequence present 

The present invention also relates to a bacterial cell, in particular an E. coli cell, characterised in that 
it comprises inserted therein a plasmid recombinant as defined above. 

60 The present invention further relates to a process for the production of such a bacterial cell 
characterised in that it comprises inserting "a plasmid recombinant as defined above into a bacterial ceil. 

Expression of urogastrone in E. coli : The urogastrone gene insert was cleaved from pUR1 by Hind III 
and Bam HI cleavage and purified by polyacrylamide gel electrophoresis and electroelution as above. This 
fragment was figated to Hind III, Bam Hl-cleaved pWT121 and pWT221 f (see. for example, Tacon, W.CA et 

65 aL (1980), Molec. Gen. Genet 177, 427) and the recombinant molecules used to transform E coli MRC 8, 
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(see for example, Emtage, J. S. et ai. (1980), Nature, 283, 171-174), with selection for 100 mj/ml ampiciiin. 
Transfdrmants containing full length urogastrone genes were characterised by restriction enzyme 
cleavage analysis and DNA was purified by isopycnic centrifugation in caesium chloride. 

Expression of urogastrone-like fusion protein was induced by growth of cells in L-broth (luria broth : 

5 1 % (w/v) bacto tryptone, 0.5 % (w/v) bacto yeast extract, 0.5 % (w/v) NaCI, 0.2 % (w/v) glucose, 0.004 % 
(w/v) thymine, pH 7) containing 100fig/ml ampicillin to an A600nm of 0.3. Following centrifugation, the 
cells were washed and resuspended in M9 medium lacking tryptophan, but containing 20 |ig/ml 3 p-indole 
acrylic acid. The cells were incubated at 37 °C for 4 hours. Under these conditions maximal tryptophan 
promoter activity is known to occur (see Tacon et al. loc cit), and hence expression- of the urogastrone 

10 fusion protein. 

The present invention also relates to a process for the production of urogastrone or an equivalent 
thereof or a sub-unit thereof characterised in that it comprises cutturing a bacterial cell as defined above 
and recovering expressed protein. 

The present invention further relates to a fused polypeptide characterised in that it comprises 
15 urogastrone or an equivalent thereof or a sub-unit thereof covalently bonded with ail or part of a gene of 
the tryptophan operon, e.g. trp E, and in that it is obtained by culturing a bacterial cell as defined above 
and recovering expressed protein. 

As mentioned above, urogastrone has an application in the treatment of ulcers, and also in other 
instances where the growth promoting activity thereof would be beneficial, for example, in wound 
20 healing. Conventional administration forms may be used, the active material being used in an effective 
amount, for example from 0.1 fo 1.0 jxg/kg body weight, preferably from 0.125 to 0.5 fig/kg body weight, 
more preferably about 0.25 pjQ/kg body weight, optimally together with a conventional * pharmaceuticaily- 
acceptable carrier, diluent or adjuvant 

25 

Claims 

1. A synthetic gene characterised in that it codes for the expression of urogastrone or an equivalent 
thereof or a sub-unit thereof in a bacterial cell and comprises the following sequence or a sub-unit 
30 thereof : 

Coding: 5'AATTCCGATAGCGA G T G T C C T C T- 
Non-coding: 3' TTAAGGCTATCGCTCACAG GAGA- 
GAGTCACGATGGTTACTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAG G CTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTT- 
GTACGCGTGTAATTGOGTTGTTGGCTACAT- 
CATGCGCACATTAACGCAACAACC G* A T G T A- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
ATGGTGGGAACTTAGA 
TACCACCCTTG AATCT 

45 2. A synthetic gene as claimed in Claim 1 characterised in that it comprises the following sequence or 
a sub-unit thereof : 

. Coding: 5' AGCTTAAAAAG AATTCCG ATAG- - 
Non-coding: 3'ATTTTTCTTAAGG CTATC-' 

50 CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 
CAGATGTACTGCCGCAGACATACATATAA- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 

55 CTCCGAGATCTGTTCATGC-GCACATTAAC- 
CGTTGTTGGCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATG GTGGGAACTT- 
TCATAG CTCTAGACTTTACCACC-CTTG A A- 

&> AGATAAG 

TCTATTCCTAG 

3. A process for the production of a synthetic gene or a sub-unit thereof as daimed in Claim 1 or 
Claim 2 characterised in that it comprises the assembly and ligation of a number of oligoncleotide blocks. 
65 4. A plasmid recombinant characterised in that it comprises a plasmid vector having inserted therein 
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at an appropriate insertion site a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2, the 
plasmid recombinant enabling translation in the correct phase for the mRNA corresponding to the 
inserted gene or sub-unit thereof and having a bacterial promoter upstream of and adjacent to the 
insertion site such that the inserted gene or sub-unit thereof is under bacterial promoter control. 
5 5. A piasmid recombinant as claimed in Claim 4 characterised in that it comprises as the plasmid 
vector pLFL • 

6. A process for the production of a piasmid recombinant as claimed in Claim 4 characterised in that 
it comprises inserting a synthetic gene or a sub-unit thereof as claimed in Claim 1 or Claim 2 into an 
appropriate insertion site of an appropriate plasmid vector. 
10 7. A bacterial cell characterised in that it comprises inserted therein a piasmid recombinant as 
claimed in Claim 4 or Claim 5. 

8. A process for the production of a bacteria] cell as claimed in Claim 7 characterised in that it 
comprises inserting a plasmid recombinant as claimed in Claim 4 or Claim 5 into a bacterial celL 

9. A process for the production of urogastrone or an equivalent thereof or a sub-unit thereof 
15 characterised in that it comprises culturing a bacterial cell as claimed in Claim 7 and recovering 

expressed protein. 

10. A fused polypeptide characterised in that it comprises urogastrone or an equivalent thereof or a 
sub-unit thereof covalently bonded with all or part of a gene of the tryptophan operon and in that it is 
obtained by culturing a bacterial cell as claimed in Claim 7 and recovering expressed protein. 

20 11. A fused polypeptide as claimed in Claim 10 wherein the gene of the tryptophan operon is the trp E 
gene. 



Patentanspruche 

25 

1. Synthetisches Gen, dadurch gekennzeichnet, daB es die Expression fur Urogastron Oder ein 
AquivaJent desselben oder eine Untereinheit desselben in einer Bakterienzelle codiert und die folgende 
Sequenz oder eine Untereinheit derselben umfaBt : 

30 codierend :5'AATTCCGATAGCGAGTGTCCTCT- 

nicht cotfierend :3' TTAAGGCTATCGCTCACAGGAGA- 
GAGTCACGATGGTTACTGTCTACATGACGG- 
CTCAGTGCTACCAATGACAGATGTACTGCC- 

CGTCTGTATGTATATTGAGGCTCTAGACAA-* 
35 GCAG ACATACATATAACTCC6 AGATCTGTT- 

GTACGCGTGTAATTGCGTTGTTGGCTACAT- 
C A T G C G C AC ATTAACGCAACA AC C G A T G T A- 
CGGTG AGCGCTGTCAGTATCGAG ATCTG AA- 
GCCACTCGCGACAGTCATAGCTCTAGACTT- 
40 ATGGTGGGAACTTAGA 
TACCACCCTTGAATCT 

2. Synthetisches Gen nach Anspruch 1, dadurch gekennzeichnet, daB es die folgende Sequenz oder 
eine Untereinheit derselben umfaBt : 

45 

codierend :5' AGCTTAAAAAG AATTGCG ATAG- - 
nicht codierend :3'ATTTTTCTTAAG G C T A T C- ' 

CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 

50 GTCTACATGACGGCGTCTGTATGTATATT- 
CAGATGTACTGCCGCAGACATACATATAA- 
GAGGCTCTAGACAAGTACGCGTGTAATTG- 
CTCCG AG ATCTGTTCATG CG CACATTAAC* 
CGTTGTTGGCTACATCGGTG AGCGCTGTC- 

55 GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATGGTGGGAACTT- 
TCATAGCTCTAG ACTTTACCACCCTTG AA- 
A G A T A A G 
TCTATTCCTAG 

60 

3. Verfahren zur Erzeugung eines synthetischen Gens oder einer Untereinheit desselben nach 
Anspruch 1 oder Anspruch 2, dadurch gekennzeichnet, daB es den Zusammenbau und die Verknupfung 
einer Anzahl von Oligonucleotid-Blocken umfaBt 

-4. Rekombinantes Plasmid, dadurch gekennzeichnet, daB es einen Plasmid- Vektor umfaBt, der ein 
55 synthetisches Gen oder eine Unterheit desselben nach Anspruch 1 oder Anspruch 2, eingefugt an einer 
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geeigneten Insertionsstelle, aufweist, wobei das rekombinante Plasmid die Translation in der korrekten 
Phase fur die dem eingefugten Gen oder der eingefugten Untereinheit desselben entsprechende mRNA 
ermoglicht und einen solchen Bakterien-Promotor von (strornaufwarts von) der Insertionsstelle und 
dieser benachbart aufweist, daB das eingefugte Gen oder die Untereinheit desselben der Steuerung 
5 durch den Bakterien-Promotor unteriiegen. 

5. Rekombinantes Plasmid nach Anspruch 4, dadurch gekennzeichnet, daB es als den Plasmid- 
Vektor pLFI enthalt 

6. Verfahren zur Erzeugung eines rekombinarrten Piasmids nach Anspruch 4, dadurch gekennzeich- 
net, da8 es das Einfugen- eines synthetischen Gens oder einer Untereinheit desselben nach Anspruch 1 

10 oder Anspruch 2 an einer geeigneten Insertionsstelle eines geeigneten Plasmid- Vektors umfaBt 

7. Bakterienzelie, dadurch gekennzeichnet, da8 sie darin eingefugt ein rekombinantes Plasmid nach 
Anspruch 4 oder Anspruch 5 enthalt 

8. Verfahren zur Erzeugung einer Bakterienzelie nach Anspruch 7, dadurch gekennzeichnet, daB es 
das Einfugen eines rekombinarrten Piasmids nach Anspruch 4 oder Anspruch 5 in eine Bakterienzelie 

15 umfaBt 

9. Verfahren zur Erzeugung von Urogastron oder eines Aquivalents desselben oder einer Unterein- 
heit desselben, dadurch gekennzeichnet daB eine Bakterienzelie nach Anspruch 7 kultiviert wird und das 
exprimierte Protein isoliert wird. 

10. Fusioniertes Polypeptid, dadurch gekennzeichnet, daB es Urogastron oder ein Aquivalent 
20 desselben oder eine Untereinheit desselben kovalent gebunden mit dem gesamten Gen oder einem Teil 

des Gens des Tryptophan-Operons umfaBt und daB es durch Kultivieren einer Bacterienzelle nach 
Anspruch 7 und Isolierung des exprimierten Proteins erhaiten wird. 

11. Fusioniertes Polypeptid nach Anspruch 10, dadurch gekennzeichnet, daB das Gen des Trypto- 
phan-Operons das trp-E-Gen ist 

25 



Revendications 

1. Gene synthetique caracteris§ en ce qu'il code pour Pexpression d'urogastrone ou un equivalent de 
30 cette derniere ou une sous-unite de cette derniere dans une cellule bacterfenne et comprend la sequence 
suivante ou une sous-unite de cette derniere : 

Codant: 5'AATTCCGATAGCGAGTGTCCTCT- 
Non-codant: 3' TTAAGGCTATC6CTCACAGGAGA- 

35 GAGTCACG ATGGTTACTGTCTACATGACGG- 

CTCAGTGCTACCAATGACAGATGTACTGCC- 
CGTCTGTATGTATATTGAG G CTCTAGACAA- 
GCAGACATACATATAACTCCGAGATCTGTT- 
GTACGCGTGTAATTGCGTTGTTGGCTACAT- 

40 CATGGGCACATTAACGCAACAACCGATGTA- 
CGGTGAGCGCTGTCAGTATCGAGATCTGAA- 
- GCCACTCG CGACAGTCATAGCTCTAGACTT- 
ATGGTGGG AACTTAGA 
TACCACCCTTG AATCT 



45 



2. G§ne synthetique selon la revendication 1 , caracteris6 en ce qull comprend la sequence suivante 
ou une sous-unite de cette derniere : 



Codant: 5' AGCTTAAAAAG AATTCCGATAG- 
50 Non-codant: 3'ATTTTTCTTAAGGCTATC- 

CGAGTGTCCTCTGAGTCACGATGGTTACT- 
GCTCACAGGAGACTCAGTGCTACCAATGA- 
GTCTACATGACGGCGTCTGTATGTATATT- 
CAGATGTACTGCCGCAGACATACATATAA- 
G A G G C T C T A G A C A A G T A C G C G T G T A A T T G- 
CTCCGAGATCTGTTCATGCGCACATTAAC- 
CGTTGTTG GCTACATCGGTGAGCGCTGTC- 
GCAACAACCGATGTAGCCACTCGCGACAG- 
AGTATCGAGATCTGAAATGGTGGGAACTT. 
TCATAGCTCTAGACTTTACCACCCTTGAA- 

A G A T A A G 
TCTATTCCTAG 



55 



60 



3. Procede de production d'un gene synthetique ou cfune sous-unite de ce dernier selon la 
65 revendication 1 ou la revendication 2, caracterise en ce qu'il comprend Passemblage et la liaison d'un 
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certain nombre de fragments ^oligonucleotides. 

4. Plasmide recombinant caracterise en ce qu'H comprend un piasmide vecteur dans lequel est 
insere en un site dlnsertion approprie un gene synth&ique ou une sous-unite de ce dernier selon la 
revendication 1 ou la revendication 2, le plasmide recombinant permettant la traduction en la phase 
5 correcte pour PARN-m correspondant au gene inse>§ ou a la sous-unite de ce dernier et ayant un 
promoteur bacterien en amont et adjacent par rapport au site dlnsertion, tel que le gene inse>6 ou la 
sous-unite" de ce dernier se trouve sous le contrdle du promoteur bacterien. 

. 5. Plasmide recombinant selon la revendication 4, caracterise en ce qu'il comprend en tant que 
plasmide vecteur pLFI. 

10 6. Proced§ de production d'un plasmide recombinant selon la revendication 4, caracterise en ce qu'il 
comprend Pinsertion d'un gene synth§tique ou d'une sous-unite de ce dernier selon la revendication 1 ou 
la revendication 2 dans un site dlnsertion approprie d'un plasmide vecteur approprie. 

7. Cellulose bacterienne caracterisee en ce que s'y trouve insere un plasmide recombinant selon la 
revendication 4 ou la revendication 5. 

15 8. Proced§ de production d'une cellule bacterienne selon la revendication 7, caracterise en ce qu'il 
comprend Tinsertion d'un plasmide recombinant selon la revendication 4 ou la revendication 5 dans une 
cellule bacterienne. 

9. Procede de production d'urogastrone ou d'un equivalent de cette demiere ou d'une sous-unite de 
cette demiere, caracterise en ce qu'il comprend la culture d'une cellule bacterienne selon la 

20 revendication 7 et la recuperation de la proteine exprimee. 

10. Polypeptide fusionn§ caracterise en ce qu'il comprend Purogastrone ou un equivalent de cette 
derniere ou une sous-unite de cette demiere liee a tout ou partie d'un gene de Poperon tryptophane et en 
ce qu'il est obtenu en cultivant une cellule bacterienne selon la revendication 7 et en recuperant la 
proteine exprimee. 

25 11. Polypeptide fusionne selon la revendication 10, caracterise en ce que le gene de Poperon 
tryptophane est le gene trp E 
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-10 -5 
Met Gin Thr Gin Lys Pro Thr Pro Ser Ser l_vs 
coding ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



Non-coding T T C 



1 5 
Leu Lys Lys Asn Ser Asp Ser Glu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGTCTACATGACGGCGTC 



TCAGTGCTACCAATGACAGATGTACTGCCGCAG 



20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
T G T A T G.T ATATTGAGGCTCTAGACAAGTACGCG 



ACATACATATAACTCCGAGATCTGTTCATGCGC 



Fi g . 1 (Part 1 of 2) 
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35 4° 
1% t'gV't't g't T G^C T^C a'Vc G^T G^GC^C 

ACATTAAC6CAACAACCGATGTAGCCACTCGCG 

Cys Gin Tyr Arg Asp Leu Lys Trp Trp^ Gtu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 
55 

Arg Ter Gly Ser 
AGATAAGGATCC coding 

TCTATTCCTAGG Non-coding 

Fig.1 (Part 2 of 2) 
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Hindlll 
5 A ' UI 

Met Gin Thr Gtn Lys Pro Thr Pro Ser Ser Lys 
ATGCAAACACAAAAACCGACTCCAAGCTCCAAG 



TT C 
'I 



f • * » • 
* • • I 



EcoRI ' Mn1I 

1 5 
Leu Lys Lys Asn Ser Asp Ser Gtu Cys Pro Leu 
CTTAAAAAGAATTCCGATAGCGAGTGTCCTCTG 



» # » 



GAATTTTTCTTAAGGCTATCGCTCACAGGAGAC 



Hinfl AccI Hgal 

10 15 
Ser His Asp Gly Tyr Cys Leu His Asp Gly Val 
AGTCACGATGGTTACTGT*CTACATGACGGCGTC 
—I — "" 



TCAGTGCTACCAATGACAGATG T»A CT GCCGCAG 
T I*'* Hgal 



Mn1I • Xbal FnuDII 
20 25 30 

Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGC'G 

I 



ACATACATATAACTCCGAGATCTGTTCATGCGC 

Hhal 



Fig. 2 (Part lof 2) 
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Hphl Haell 
35 40 



Cys Asn Cys Val Val Giy Tyr He Gly Glu Aro 
T G T A A T T G CGTTGTTGGCT ACATCG6TGAGC 6C 

I 



ACATTAACGCAACAACCGATGTAGCCACTCGCG 

♦++++♦+♦♦ ♦[•«•*• 

•I" 

Mbol 
TaqI Bglll 

45 50 
Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
T G T C+A GTATCGAGATCTGAAATGGT GGGAACTT 
•I I 

•I ** 

A C A+G TCATAGCTCTAGACTTTACCACCCTTGAA 
Hphl I I 
— I € 



BamHl 
Mbol 

Arg Ter Gly Ser 
AGAT AAGGATCC 
•I 

TCTATT CCTAGG 

r 

i 



Fig. 2 (Part 2 6f 2) 
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coding A G 



Non -coding — 



1 5 
Leu LyS lys Ash Ser Asp Ser Glu Cys__ Pro Leu 
CTTAAAAaGAATTCCG AT AGCGAGTGTCC TCTG 
-! 1" 

I (13) HI (18) 



-— ATTTT TCTT AAGGC TA.TCGCTC ACAGGAGAC 
j J 

11(15) IV (18) 



10 15 
Ser His Asp Gly Tyr Cys Leu His_ Asp, Glv Val 

agtcacgatggtttactgtctacatgacgg'cgtc 

V (18) VII (18) 

TCAGTGCTACCAATGACAGATGTACTGCCGCAG 

— I—" I 

VI (18) VIII (18) 



20 25 30 
Cys Met Tyr He Glu Ala Leu Asp Lys Tyr Ala 
TGTATGTATATTGAGGCTCTAGACAAGTACGCG 
1 -I 

IX (12) XI (12) I- 

XIII (18) 

ACATACATATAACTCCGAGATCTGTTCATGCGC 

X (18) I " 

XII (20) 



Fig. 3 (Parti of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr He Gly Glu Arg 
TGTAATT GCGTTG TTGGCTACATCGGTGAGCGC 

1 "xv (16) XVII (13) 



ACATTAACGCAACAACC G A TGTAGCCACTCGCG 
1 1 

XII (20) XIV (16) XVI (12) 



45 50 
Cys Gin Tyr Arg Asp Leu • Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

1 1- 

XIX (15) XXI (12) XXIII (14) 

ACAGTCATAGCTCTAGACTTTACCACCCTTGAA 

I r — I - I- 

XVIII (14) XX (14) 



Arg Ter 

AGATAAG Coding 
XXIII OA)- 



I.5.IAI.I.™.I. A .? Non-coding 
XXII (12) 



Fjg.3 (Part2of2) 
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coding A G 
I— 



Non-coding - 



5 



C L f T »A A L fG aYt T^C <ftft A^G M Jg M 
1*03)" 1 I1I{18) 



..-•ATTTTTCTTAAGGCTATC6CTCACAGGA6AC 

1 nns) " I " IV (18) 



10 15 



» T «C G^T <R?T A'c flPf <W A C^T G^T^C 

Vos" l " ' V1I " (18) 

TCAGTGCTACCAATGACAGATGT A CTG CCA C_A G 

"" , 'vTm : " VIll * (18) 



25 



30 



£■ Met ivr He Glu Ala Leu Asd lys Tyr Ala 

T £V aTg t St a t c g a a gckt_agac a a g t a c g c g 

"~«"o2)- "" l ""v^2)""i-"- - x — - 

A C A T A C A T A T A_G_C TJ [ C G A G A _T CT G T T C A T G C G C 
1 X* 08) I" "" 



XII (20) 



Fig . 4 (Part 1 of 2) 
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35 40 
Cys Asn Cys Val Val Gly Tyr Ite Gly Glu Arg 
TGTAATTGCGTTGTTGGCTACATCGGTGAGCGC 

1 - I I- 

XV (16) XVII (13) 



ACATTAACGGAACAACCGATGTAGCCACTCGCG 

— I 1- 

XII (20) XIV (16) XVI (12) 



45 50 
Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu 
TGTCAGTATCGAGATCTGAAATGGTGGGAACTT 

— - I- I— - 

XIX (15) XXI (12) XXIII (14) 

ACAGTC ATAGCTC TAGACTT TACCACCCTT G A A 

1 - — I - 1- 

XVIII (14) XX (14) 



Arg Ter 

AGATAAG Coding 



XXI 1 1 (14) 



LlTAllS.S™.? Non-cod.ng 
XXII (12) 
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